Regulatory zinc finger proteins

ABSTRACT

Disclosed are chimeric zinc finger proteins that can regulate endogenous genes. Examples of such proteins include proteins that can regulate VEGF-A expression. The proteins and nucleic acid encoding them can be used to modulate angiogenesis.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims priority to U.S. Application Ser. No.60/431,892, filed on Dec. 9, 2002, the contents of which areincorporated by reference herein.

TECHNICAL FIELD

This invention relates to DNA-binding proteins such as transcriptionfactors.

BACKGROUND

Most genes are regulated at the transcriptional level by polypeptidetranscription factors that bind to specific DNA sites within in thegene, typically in promoter or enhancer regions. These proteins activateor repress transcriptional initiation by RNA polymerase at the promoter,thereby regulating expression of the target gene. Many transcriptionfactors, both activators and repressors, are modular in structure. Suchmodules can fold as structurally distinct domains and have specificfunctions, such as DNA binding, dimerization, or interaction with thetranscriptional machinery. Effector domains such as activation domainsor repression domains retain their function when transferred toDNA-binding domains of heterologous transcription factors. Brent andPtashne (1985) Cell 43:729-36; Dawson et al., (1995) Mol. Cell Biol.15:6923-31. The three-dimensional structures of many DNA-bindingdomains, including zinc finger domains, homeodomains, andhelix-turn-helix domains, have been determined from NMR and X-raycrystallographic data.

Zinc finger domains are one type of structural domain that is modular infunction. Zinc finger proteins (ZFPs) can be used to regulatetranscription. For example, Kim and Pabo demonstrated that the Zif268protein efficiently repressed VP16-activated transcription of a targetgene when the Zif268 protein was bound near the transcription start siteof a target gene. Kim and Pabo (1997) J Biol Chem 272:29795-29800. Liuet al. describe up-regulating VEGF-A using engineered zinc fingerproteins constructed by site-specific mutagenesis. Liu et al. (2001). J.Biol. Chem. 276, 11323-11334.

SUMMARY

In one aspect, the invention features a polypeptide that includes a DNAbinding domain and can regulate expression of a gene in a cell, e.g., aneukaryotic cell. In one embodiment, the polypeptide binds to a targetDNA site in the gene. The DNA binding domain typically includes at leastthree zinc finger domains.

In one embodiment, at least one, two, or three of the zinc fingerdomains are naturally-occurring zinc finger domains. For example, thesedomains can be identical to zinc finger domains of different naturallyoccurring proteins, or identical to non-adjacent zinc finger domainsfrom a naturally occurring protein. All the zinc finger domains can benaturally-occurring.

In another embodiment, at least one, two, or three of the zinc fingerdomains is a variant of a naturally-occurring zinc finger domain, e.g.,a domain that differs by between one and four or two and five amino acidresidues. The polypeptide may include a combination ofnaturally-occurring zinc finger domains and variant domains.

The polypeptide may regulate any gene. Regulation can be direct such aswhen the polypeptide interacts with a target site in the target gene.For example, the gene can be an endogenous gene of a cell (e.g., a genepresent in a natural genome), a heterologous gene (e.g., a transgene) ora viral gene. In one embodiment, the endogenous gene encodes a secretedpolypeptide or a polypeptide that participates in or regulatesproduction of a secreted factor, e.g., a secreted polypeptide. In oneembodiment, the endogenous gene regulates cell proliferation, cellmigration, or tissue morphogenesis (e.g., angiogenesis).

In one embodiment, the endogenous gene encodes a polypeptide thatregulates hormone synthesis, a hormone, or growth factor. Exemplarygrowth factors include the VEGF family of growth factors.

VEGF-A is one member of this family. In one embodiment, the polypeptiderecognizes a target site in the regulatory region of the VEGF-A gene,e.g., a site located between −950 and +450 of the VEGF-A gene. Forexample, the polypeptide can recognize a site that is located at

about −680, −677, −671, −668, −665, −633R, −632R, −631, −630, −606,−603, −554, −536, −495, −475, −468, −465, 462, −455, −395R, −394R,−393R, −392, −391R, −385R, −382R, −358R, −314R, −282, −206, −206, −203,−184, −181, −137, −124, −90R, −85, −30, 77, 244R, 283R, 342, 357, 366,434, 435, or 474R of the human VEGF-A promoter, or a site within 60, 50,20, 10, 5, or 3 nucleotides of such sites. These nucleotide positionsindicate the 5′ most nucleotide of the site from transcriptionalinitiation site on the upper strand of the promoter, unless the letter“R appears, in which case, the numbering of those positions (with the Rdesignation) indicates the 5′ most nucleotide site on the lower(reverse) strand. For example, F435(−90R) target sequence is 5′:−90 to3′:−98 on reverse strand (5′: −98 to 3′:−90 position fromtranscriptional initiation site on upper strand). In one embodiment, thepolypeptide competes with a polypeptide having a sequence describedherein for binding to its target site in the VEGF-A gene.

In one embodiment, the target site is in a regulatory region of theendogenous gene overlaps with a DNase hypersensitive site, or overlapswith the binding site of an endogenous transcription factor. In anotherembodiment, the target site is within 700, 500, 300, 200, 50, 20, 10, 5,or 3 basepairs of such a site or region. In one embodiment, thepolypeptide binds to the target site with a dissociation constant of nomore than 20, 7, 5, 3, 2, 1, 0.5, or 0.05 nM.

In one embodiment, when the polypeptide is in a cell, it is able toalter transcription (e.g., represses or activates) of the endogenousgene at least 1.25, 1.5, 1.7, 1.9, 2.0, 2.5, 5, 10, 20, 50, or 100 fold.The polypeptide may have a similar effect when in a cell in an organism.

In one embodiment, the DNA binding domain includes at least two zincfinger domains listed in a single row of Table 1, Table 2, Table 3,Table 4, or Table 5 or includes at least two zinc finger domains thathave identical DNA contacting residues as two zinc finger domains listedin a single row of Table 1, Table 2, Table 3, Table 4, or Table 5.

The polypeptide can further include a transcriptional activation orrepression domain. The polypeptide can further include a celltransduction domain, e.g., the HIV tat transduction domain.

In one embodiment, the polypeptide suppresses induction of VEGF-Aproduction by hypoxia in a mammalian cell. The suppression can be, e.g.,such that VEGF-A levels are less than 80, 70, 60, 50, 40, 30, 20, 10, 5,3, 2, 1, or 0.1% of the protein level induced by hypoxia in an otherwiseidentical cell that lacks the polypeptide

The invention also provides a nucleic acid that includes a sequence thatencodes a polypeptide described herein and a cell (e.g., a prokaryoticor eukaryotic, e.g., mammalian cell) that includes the nucleic acid. Thecell can express the nucleic acid and produce the polypeptide. In oneembodiment, the cell is cultured in vitro. The cell can beimmuno-isolated or encapsulated. The invention also provides an organismthat includes one or more cells in which the polypeptide is produced andan endogenous gene is regulated by the polypeptide.

In another aspect, the invention features a method of regulating anendogenous gene, the method including: providing a cell that includes acoding nucleic acid encoding an artificial polypeptide that includes atleast three zinc finger domains, wherein the polypeptide binds to atarget DNA site in an endogenous gene; and expressing the coding nucleicacid in the cell under conditions in which the artificial polypeptide isproduced, binds to the target DNA site, and regulates the endogenousgene. In one embodiment, at least two of the zinc finger domains arenaturally-occurring zinc finger domains. For example, the two zincfinger domains can be identical to zinc finger domains of differentnaturally occurring proteins, or can be non-adjacent zinc finger domainsfrom the same naturally occurring protein. In one embodiment, theartificial polypeptide includes a transcriptional activation orrepression domain. The endogenous gene can be repressed or activated. Inone embodiment, the cell is provided by contacting the cell with anucleic acid delivery vehicle, e.g., a liposome, virus, or viralparticle. In one embodiment, the cell is a cell within an organism,e.g., a mammalian organism. The method can further include, prior to theexpressing, introducing the cell into a subject organism, orencapsulating the cell and introducing the encapsulated cell into asubject organism.

Exemplary polypeptides can include at least two or more zinc fingerdomains, e.g., two, three or four zinc finger domain in a particular rowof a table below: TABLE 1 Exemplary VEGF-A Binding Proteins (A) NameMotifs (Col. 2) Specific Domains (Col. 3) F475 mQSHR-mRDHT-mRSNRQSHR2-RDHT-RSNR F121 mQSHT-mRSHR-mRDHT QSHT-RSHR-RDHT F435mQSHR-mRDHT-mRSHR QSHR2-RDHT-RSHR F547 mRSHR-mRDHT-mVSNV RSHR-RDHT-VSNVF2825 mQSHV-mRDHR-mRDHT QSHV-RDHR1-RDHT

TABLE 2 Exemplary VEGF-A Binding Proteins (B) Name Motifs (Col. 2)Specific Domains (Col. 3) F480 mRSHR-mRDHT-mRSHR RSHR-RDHT-RSHR F2828mCSNR-mWSNR-mRDHR CSNR1-WSNR-RDHR1 F625 mCSNR-mWSNR-mRSHRCSNR1-WSNR-RSHR F2830 mDSNR-mWSNR-mRDHR DSNRa-WSNR-RDHR1 F2838mDSNR-mWSNR-mRSHR DSNRa-WSNR-RSHR

TABLE 3 Exemplary VEGF-A Binding Proteins (C) Name Motifs (Col. 2)Specific Domains (Col. 3) F109 mRDER-mQSSR-mQSHT-mRSNRRDER1-QSSR1-QSHT-RSNR F2604 mDSAR-mRSNR-mRDHT-mVSSR DSAR2-RSNR-RDHT-VSSRF2605 mQSHT-mDSAR-mRSNR-mRDHT QSHT-DSAR2-RSNR-RDHT F2607mRDHT-mVSNV-mQSHT-mDSAR RDHT-VSNV-QSHT-DSAR2 F2615mRSHR-mDSCR-mQSHT-mDSCR RSHR-DSCR-QSHT-DSCR F2633mQSNR-mQSHR-mRDHT-mRSNR QSNR3-QSHR2-RDHT-RSNR F2634mCSNR-mRDHT-mRSNR-mRSHR CSNR1-RDHT-RSNR-RSHR F2636mRSHR-mQSHT-mRSHR-mRDER RSHR-QSHT-RSHR-RDER1 F2644mQSNR-mRSHR-mQSSR-mRSHR QSNR3-RSHR-QSSR1-RSHR F2646mQSHT-mDSCR-mRDHT-mCSNR QSHT-DSCR-RDHT-CSNR1 F2650mQSHT-mWSNR-mRSHR-mWSNR QSHT-WSNR-RSHR-WSNR F2679mVSNV-mRSHR-mRDER-mQSNV VSNV-RSHR-RDER1-QSNV2

TABLE 4 Exemplary VEGF-A Binding Proteins (D) Name Motifs (Col. 2)Specific Domains (Col. 3) F2610 mRSNR-mRSHR-mRDHT- RSNR-RSHR-RDHT-RSHRmRSHR F2612 mRSHR-mRDHT-mRSHR- RSHR-RDHT-RSHR-RDHT mRDHT F2638mRSNR-mQSHR-mRDHT- RSNR-QSHR2-RDHT-RSHR mRSHR

TABLE 5 Exemplary VEGF-A Binding Proteins (E) Name Motifs (Col. 2)Specific Domains (Col. 3) F2608 mRSHR-mRDHT-mVSNV-mQSHTRSHR-RDHT-VSNV-QSHT F2611 mRSHR-mRSHR-mWSNR-mRSHR RSHR-RSHR-WSNR-RSHRF2617 mRDER-mRSHR-mDSCR-mQSHT RDER1-RSHR-DSCR-QSHT F2619mRSHR-mVSTR-mQSNR-mRDHT RSHR-VSTR-QSNR3-RDHT F2623mQSHT-mRSNR-mWSNR-mRDER QSHT-RSNR-WSNR-RDER1 F2625mQSHT-mWSNR-mRDHT-mRDER QSHT-WSNR-RDHT-RDER1 F2628mVSSR-mWSNR-mRSNR-mVSSR VSSR-WSNR-RSNR-VSSR F2629mQSHR-mVSSR-mWSNR-mRSNR QSHR2-VSSR-WSNR-RSNR F2630mRDER-mQSHR-mVSSR-mWSNR RDER1-QSHR2-VSSR-WSNR F2635mQSHR-mRSNR-mQSHR-mRDHT QSHR2-RSNR-QSHR2-RDHT F2637mRDHT-mRSNR-mRSHR-mWSNR RDHT-RSNR-RSHR-WSNR F2642mRDHT-mRSHR-mCSNR-mRDHT RDHT-RSHR-CSNR1-RDHT F2643mRSHR-mCSNR-mRDHT-mCSNR RSHR-CSNR1-RDHT-CSNR1 F2648mQSSR-mQSHR-mRSNR-mRSNR QSSR1-QSHR2-RSNR-RSNR F2651mVSTR-mQSHT-mWSNR-mRSHR VSTR-QSHT-WSNR-RSHR F2653mVSTR-mQSNR-mRSHR-mQSNR VSTR-QSNR3-RSHR-QSNR3 F2654mQSNR-mRSHR-mQSNR-mVSNV QSNR3-RSHR-QSNR3-VSNV F2662mDSCR-mRDHT-mVSTR-mRDER DSCR-RDHT-VSTR-RDER1 F2667mRSHR-mDSCR-mRDHT-mRSHR RSHR-DSCR-RDHT-RSHR F2668mRSHR-mRSHR-mQSNV-mQSNV RSHR-RSHR-QSNV2-QSNV2 F2673mRDHT-mVSSR-mRDER-mQSSR RDHT-VSSR-RDER1-QSSR1 F2682mRSNR-mQSSR-mQSNR-mRSHR RSNR-QSSR1-QSNR3-RSHR F2689mRSNR-mDSAR-mQSNR-mQSHT RSNR-DSAR2-QSNR3-QSHT F2697mRSHR-mCSNR-mQSHT-mRSNR RSHR-CSNR1-QSHT-RSNR F2699mRSNR-mQSHT-mDSAR-mRSHR RSNR-QSHT-DSAR2-RSHR F2703mQSHR-mRSHR-mRDER-mRSHR QSHR2-RSHR-RDER1-RSHR F2702mRSHR-mQSHR-mRSHR-mQSNV RSHR-QSHR2-RSHR-QSNV2

In one aspect, the invention features a polypeptide that includes a DNAbinding domain. The DNA binding domain has a plurality of zinc fingerdomains. The polypeptide can alter the expression or production ofVEGF-A in cells. For example, the polypeptide can alter the normalresponse of the cells to a signal that would increase or decrease VEGF-Aproduction or expression. In one embodiment, the polypeptide cansuppresses induction of VEGF-A production or expression in cells underconditions in which VEGF-A production or expression is induced. Forexample, the suppression can have a magnitude such that VEGF-A proteinor mRNA levels are less than 80, 70, 60, 50, 40, 30, 20, 10, 5, 3, 2, 1,or 0.5% of the level induced by the condition in otherwise identicalcell that lacks the polypeptide. In one embodiment, the conditionincludes hypoxia.

These conditions can be determined with particularity in human embryonickidney 293F cell, e.g., as described in the examples below.

The polypeptide can be used in a wide variety of implementations, e.g.,in a human cell in culture or in an organism, e.g., in a human ornon-human mammalian organism.

In one embodiment, the polypeptide binds to a site in the human VEGF-Agene. In another embodiment, the polypeptide functions indirectly, e.g.,it binds to a site in another gene.

In one embodiment, the polypeptide includes a repression domain. Thepolypeptide can include other features described herein. The inventionalso features a composition, e.g., a pharmaceutical composition thatincludes the polypeptide or a nucleic acid encoding the polypeptide.

The composition can be administered to a subject, e.g., in an amounteffective to reduce angiogenesis in the subject, e.g., in the vicinityof a lesion in the subject (e.g., a neoplasm) or throughout the subject.In one embodiment, the subject is a human that has or is suspected ofhaving a metastatic cancer.

With respect to any featured polypeptide, the polypeptide can furtherinclude a heterologous sequence, e.g., a nuclear localization signal, asmall molecular binding domain (e.g., a steroid binding domain), anepitope tag or purification handle, a catalytic domain (e.g., a nucleicacid modifying domain, a nucleic acid cleavage domain, or a DNA repaircatalytic domain), a transcriptional function domain (e.g., anactivation domain, a repression domain, and so forth), a proteintransduction domain (e.g., from HIV tat), and/or a regulatory site(e.g., a phosphorylation site, ubiquitination site, or protease cleavagesite).

The polypeptide can be formulated in a pharmaceutical composition, e.g.,with one or more additional components. The composition or polypeptidecan be included in a kit that also includes another agent orinstructions for use, e.g., therapeutic use.

The polypeptide can be attached (covalently or non-covalently) to asolid support, e.g., a bead, matrix, or planar array. The polypeptidecan also be attached to a label such as a radioactive compound, afluorescent compound, another detectable entity, or a component of adetection system (e.g., a chemiluminescent agent).

The invention also includes an isolated nucleic acid that includes asequence encoding one of the aforementioned polypeptides. The nucleicacid can further include an operably linked regulatory sequence, e.g., apromoter, a transcriptional enhancer, a 5′ untranslated region, a 3′untranslated region, a virus packaging sequence, and/or a selectablemarker. The nucleic acid can be packaged in a virus, e.g., a virus thatcan infect a mammalian cell, e.g., a lentivirus, retrovirus, pox virus,or adenovirus.

The invention further provides a cell that contains the polypeptide orthe nucleic acid that includes a sequence encoding the polypeptide. Thecell can be within a tissue in a subject organism or in culture. Thecell can be an animal (e.g., mammalian, e.g., a human or non-humancell), plant, or microbial (e.g., fungal or bacterial) cell. The cellcan be prepared by introducing the polypeptide into the cell or a parentcell or by introducing the nucleic acid into the cell or parent cell.The nucleic acid can be used to produce the polypeptide in the cell.

The invention still further includes a non-human transgenic mammal,e.g., a mouse, rat, pig, rabbit, cow, goat, or sheep. The geneticcomplement of the transgenic mammal includes the nucleic acid sequenceencoding the chimeric zinc finger polypeptide described above andelsewhere herein. The invention also includes method of producing thepolypeptide, e.g., by expressing the nucleic acid, and of using thepolypeptide, e.g., to regulate endogenous genes or viral genes in acell.

With respect to any polypeptide herein that regulates VEGF-A, thepolypeptide can be used in a method of regulating VEGF-A expression in acell. The method includes introducing the polypeptide or a nucleic acidthat includes a sequence encoding the polypeptide into a cell. Forexample, the polypeptide can be introducing using a liposome or byfusion to a protein transduction domain. A nucleic acid can beintroduction, e.g., by transfection or viral delivery.

The invention also features a composition, e.g., a pharmaceuticalcomposition that includes a polypeptide that regulates VEGF-A, e.g., asdescribed herein, or a nucleic acid encoding the polypeptide. In oneembodiment, the polypeptide can suppress VEGF-A expression and thecomposition can be administered to a subject, e.g., in an amounteffective to reduce angiogenesis in the subject, e.g., in the vicinityof a lesion in the subject (e.g., a neoplasm) or throughout the subject.In one embodiment, the subject is a human that has or is suspected ofhaving a metastatic cancer.

In another embodiment, the polypeptide can increase VEGF-A expression,and the composition is administered to a subject, in an amount effectiveto increase angiogenesis in the subject. For example, increasedangiogenesis can required for, e.g., vascular formation, embryonicdevelopment, somatic growth, differentiation of nerve system,maintenance of pregnancy, wound healing etc. The vascular endothelialgrowth factor (VEGF-A), one of endothelial cell specific growth factor,is s a key factor that regulates endothelial cell growth anddifferentiation.

Insufficient levels of VEGF or its VEGF₁₆₄ and VEGF₁₈₈ isoform lead topost-natal angiogenesis and ischemic heart disease. Activation of VEGF-Acan be used for the treatment or prevention of peripheral artery diseaseand coronary artery disease. For example, the subject can be a humanthat has or is suspected of having a wound (internal or external),pregnancy, a neurological problem, an embryonic developmental problem, acardiovascular disease (e.g., ischemic heart disease, peripheral arterydisease, or coronary artery disease).

At least 5 isoforms of VEGF-A protein are produced from different splicevariants. These isoforms have different effects on angiogenesis. Theactivation of VEGF-A by a zinc finger protein, e.g., a protein describedherein, may, in some implementations, enable upregulation of entire orparticular splice variants that are important for a desired clinicaloutcome. For example, the zinc finger protein may modulate expressionall splice variants, or it may modulate expression of a subset of splicevariants, e.g., at least one splice variant.

In another aspect, the invention features an encapsulated compositionthat includes an encapsulation layer composed of a biocompatiblematerial, and recombinant mammalian cells, wherein the cells contain anucleic acid comprising a sequence encoding a chimeric zinc fingerprotein that regulates production of a factor, e.g., secreted factor ora non-secreted protein, e.g., a cytoplasmic protein. In one embodiment,the biocompatible material is permeable at least to proteins having amolecular weight of 10, 20, 30, or 40 kDa. The biocompatible materialcan retain proteins larger than, e.g., 50, 100, 120, or 200 kDa.

The invention also provides a rapid and scalable cell-based method foridentifying and constructing chimeric proteins, e.g., transcriptionfactors. Such transcription factors can be used, for example, foraltering the expression of endogenous genes in biomedical andbioengineering applications. Activity of the transcription factors canbe assayed in vivo and in cultured cells, e.g., in intact, living cellsin culture.

In yet another aspect, the invention features a method of characterizinga chimeric zinc finger protein, e.g., a zinc finger protein describedherein. The method includes: introducing a nucleic acid that encodes theprotein into a cell; expressing the nucleic acid; and evaluatingexpression of a target gene. For example, the evaluating can includedetermining the profile of expression of endogenous genes in the cell.Such an expression profile includes a plurality of values, wherein eachvalue corresponds to the level of expression of a different gene,splice-variant or allelic variant of a gene (i.e., mRNA level) or theabundance of a translation product (i.e., protein level). The value canbe a qualitative or quantitative assessment of the level of expressionof the gene or the translation product of the gene, i.e., an assessmentof the abundance of 1) an mRNA transcribed from the gene, or 2) thepolypeptide encoded by the gene.

In yet another aspect, the invention features a method of identifying achimeric zinc finger protein that can bind to a particular target site.The method includes: providing data records, each record associating anidentifier for a naturally-occurring zinc finger domain (e.g., a humanzinc finger domain) and at least one 3- or 4-basepair subsite that isrecognized by the zinc finger domain referenced by the identifier;parsing the target site into at least two 3- or 4-basepair subsites; foreach of the subsites, retrieving a set of the identifiers from the datarecords, the set comprising identifiers for the zinc finger domains thatrecognize the subsite; and designing a polypeptide that comprises a zincfinger domain for each of the subsites, the zinc finger domain beingreferenced by an identifier from the set for the respective subsite.

The data records can include a record that identifies a zinc fingerdomain of interest. The method can further include the step ofsynthesizing a nucleic acid that encodes the polypeptide and/orsynthesizing the polypeptide in vitro. The method can also include thestep of assessing the binding of the polypeptide to the target site,e.g., using an in vitro binding assay or an in vivo assay such as anassay for target gene expression. The synthesized polypeptide canfurther include an activation or repression domain.

In one embodiment, the method further includes assessing the ability ofthe polypeptide to alter the expression of one or more endogenous genes.The assessing can include profiling the expression of multipleendogenous genes, e.g., using nucleic acid microarrays, or a single orlimited number of genes. The method can also further include contactingthe polypeptide with a DNA that includes the target site, e.g., invitro.

In another embodiment, the method further includes retrieving a nucleicacid encoding the polypeptide from an addressed library of nucleicacids, each nucleic acid of the library including a sequence encodingfirst and second zinc finger domains.

In another aspect, the invention features certain polypeptides andisolated nucleic acids. A polypeptide of the invention can include, forexample, one, two, three, or four zinc finger domains and be related toa reference polypeptide that has a particular amino acid sequenceprovided herein. For example, the polypeptide can have the sameDNA-contacting residues in one, two, three, four or more zinc fingerdomains as the DNA-contacting residues in respective zinc finger domainsof the reference polypeptide. In another example, in three zinc fingerdomains of the polypeptide, at least 9, 10, or 11 of the DNA-contactingresidues (3×4) are identical to the DNA-contacting residues ofrespective zinc finger domains in the reference polypeptide. In anotherexample, in four zinc finger domains of the polypeptide, at least 12,13, 14, or 15 of the DNA-contacting residues (4×4) are identical to theDNA-contacting residues of respective zinc finger domains in thereference polypeptide. The polypeptide can be able to bind to the samesite as the reference polypeptide, and regulate the same endogenousgene, e.g., within 0.1 to 10 or 0.5 to 1.5 fold of the activity of thereference polypeptide.

In one embodiment, the amino acid sequences of one or more (e.g., all)of the zinc finger domains are naturally occurring sequences. In oneembodiment, the polypeptide is able to regulate a target gene, e.g., anendogenous cellular gene, e.g., the same gene as the referencepolypeptide, e.g., VEGF-A.

In addition, purified polypeptides of the invention can have an aminoacid sequence at least 50%, 60%, 70%, 80%, 90%, 93%, 95%, 96%, 98%, 99%,or 100% identical to a zinc finger domain described herein. Thepolypeptides can be identical to a zinc finger domain described hereinat the amino acid positions corresponding to the DNA contacting residuesof the polypeptide. Alternatively, the polypeptides differ from a zincfinger domain described herein at least one of the residuescorresponding to the DNA contacting residues of the polypeptide. Forexample, one or more zinc finger domains in the polypeptides include aconservative substitution at a DNA contacting residue.

The polypeptides can also differ at least one, two, or three residues,e.g., residues other than a DNA contacting residue. For example, withina given zinc finger domain, the polypeptide may differ by a single aminoacid from the amino acid sequences referenced above, or by two, three,or four amino acids from the sequences referenced above. The differencemay be due to a conservative substitution as defined herein. In oneembodiment, the amino acids differences with respect to the sequencesreferenced above are located between the second zinc-coordinatingcysteine and the −1 DNA contacting position (referring to the numberingsystem for DNA contacting positions described below).

The comparison of sequences and determination of percent identitybetween two sequences can be accomplished using a mathematicalalgorithm. In particular, the percent identity between two amino acidsequences is determined using the Needleman and Wunsch ((1970) J. Mol.Biol. 48:444-453) algorithm which has been incorporated into the GAPprogram in the GCG software package, using a Blossum 62 scoring matrixwith a gap penalty of 12, a gap extend penalty of 4, and a frameshiftgap penalty of 5.

The purified polypeptides can also include one or more of the following:a heterologous DNA binding domain, a nuclear localization signal, asmall molecular binding domain (e.g., a steroid binding domain), anepitope tag or purification handle, a catalytic domain (e.g., a nucleicacid modifying domain, a nucleic acid cleavage domain, or a DNA repaircatalytic domain) and/or a transcriptional function domain (e.g., anactivation domain, a repression domain, and so forth). In oneembodiment, the polypeptide further includes a second zinc fingerdomain, e.g., a domain having a sequence described herein. For example,the polypeptide can include an array of zinc fingers that include two ormore zinc finger domains. In one embodiment, one or more of the domains(e.g., each domain) can have a sequence that conforms to a motifdescribed herein, e.g., mCSNR, mDSAR, mnDSCR, mISNR, mQFNR, mQSHV,mQSNI, mQSNK, mQSNR, mQSNV, mQSSR, mQTHQ, mQTHR, mRDER, mRDHT, mRDKR,mRSHR, mRSNR, mVSNV, mVSSR, mVSTR, mWSNR, mDGNV, mDSNR, and mRDNQ.Further, each domain can have a sequence provided herein. As describedbelow, the small letter “m” prefix indicates that the listed four aminoacids represent a motif of DNA contacting residues.

Nucleic acids of the invention include nucleic acids encoding theaforementioned polypeptides. A nucleic acid of the invention can beoperably regulated by a heterologous nucleic acid sequence, e.g., aninducible promoter (e.g., a steroid hormone regulated promoter, asmall-molecule regulated promoter, or an engineered inducible systemsuch as the tetracycline Tet-On and Tet-Off systems). In one embodiment,the promoter is inducible in a mammalian cell.

As described herein, the polypeptide can be produced in a cell and canregulate a gene in the cell, e.g., an endogenous gene, by binding to atarget site, e.g., a site that includes a subsite that the respectivezinc finger domain(s) recognizes. The cell can be mammalian cell.

The invention further includes a method of expressing a polypeptidedescribed herein, fused to a heterologous nucleic acid binding domain.The method includes introducing into a cell a nucleic acid encoding theaforementioned fusion protein.

In another aspect, the invention features an encapsulated composition.The composition includes an encapsulation layer composed of abiocompatible material and recombinant mammalian cells. The cellscontain a nucleic acid including a sequence encoding a chimeric zincfinger protein that regulates production of another nucleic acid in thecells, e.g., a heterologous nucleic acid or an endogenous nucleic acid.For example, the cells can regulate a gene that encodes a secretedpolypeptide or that regulates or participates in the production of asecreted factor, e.g., a secreted polypeptide. In one embodiment, thesecreted polypeptide is insulin, an insulin-like growth factor, VEGF-A,HGF, interferon, interleukin, or a fibroblast growth factor.

The encapsulation layer typically is permeable at least to proteinshaving a molecular weight of 10 kDa, e.g., proteins about 10, 20, 30,40, 50, or 70 kDa in molecular weight. The encapsulation layer can beimpermeable, e.g., to proteins larger than those molecular weights,e.g., larger than 100 kDa. Additional encapsulation layers may bepresent. The chimeric zinc finger protein can include one or morefeatures described herein.

The term “zinc finger protein” refers to any protein that includes azinc finger domain. A protein can include one or more polypeptidechains. Exemplary zinc finger proteins include two, three, four, five,six, or more zinc finger domains. Typically the protein is a singlechain. However, in some embodiment, the protein can include a pluralityof polypeptide chains For example, the protein can be a heterodimeric orhomodimeric protein.

The term “base contacting positions,” “DNA contacting positions,” or“nucleic acid contacting positions” refers to the four amino acidpositions of zinc finger domains that structurally correspond to thepositions of amino acids arginine 73, aspartic acid 75, glutamic acid76, and arginine 79 of ZIF268. (SEQ ID NO:129) Glu Arg Pro Tyr Ala CysPro Val Glu Ser Cys Asp Arg Arg Phe Ser 1               5                  10                  15 Arg Ser AspGlu Leu Thr Arg His Ile Arg Ile His Thr Gly Gln Lys            20                  25                  30 Pro Phe Gln CysArg Ile Cys Met Arg Asn Phe Ser Arg Ser Asp His         35                  40                  45 Leu Thr Thr His IleArg Thr His Thr Gly Glu Lys Pro Phe Ala Cys    50                  55                  60 Asp Ile Cys Gly Arg LysPhe Ala Arg Ser Asp Glu Arg Lys Arg His 65                  70                  75                  80 Thr Lys IleHis Leu Arg Gln Lys Asp                 85

These positions are also referred to as positions −1, 2, 3, and 6,respectively. To identify positions in a query sequence that correspondto the base contacting positions, the query sequence is aligned to thezinc finger domain of interest such that the cysteine and histidineresidues of the query sequence are aligned with those of finger 3 ofZif268. The ClustalW WWW Service at the European BioinformaticsInstitute (Thompson et al. (1994) Nucleic Acids Res. 22:4673-4680)provides one convenient method of aligning sequences.

Conservative amino acid substitutions refer to the interchangeability ofresidues having similar side chains. For example, a group of amino acidshaving aliphatic side chains is glycine, alanine, valine, leucine, andisoleucine; a group of amino acids having aliphatic-hydroxyl side chainsis serine and threonine; a group of amino acids having amide-containingside chains is asparagine and glutamine; a group of amino acids havingaromatic side chains is phenylalanine, tyrosine, and tryptophan; a groupof amino acids having basic side chains is lysine, arginine, andhistidine; a group of amino acids having acidic side chains is asparticacid and glutamic acid; and a group of amino acids havingsulfur-containing side chains is cysteine and methionine. Depending oncircumstances, amino acids within the same group may be interchangeable.Some additional conservative amino acids substitution groups are:valine-leucine-isoleucine; phenylalanine-tyrosine; lysine-arginine;alanine-valine; aspartic acid-glutamic acid; and asparagine-glutamine.

The term “heterologous polypeptide” refers either to a polypeptide witha non-naturally occurring sequence (e.g., a hybrid polypeptide) or apolypeptide with a sequence identical to a naturally occurringpolypeptide but present in a milieu in which it does not naturallyoccur. For example, the fusion of two naturally occurring polypeptidesthat are not fused together in Nature results in a heterologouspolypeptide in which one polypeptide is heterologous to the other.

The term “hybrid” refers to a non-naturally occurring polypeptide thatcomprises amino acid sequences derived from either (i) at least twodifferent naturally occurring sequences; (ii) at least one artificialsequence (i.e., a sequence that does not occur naturally) and at leastone naturally occurring sequence; or (iii) at least two artificialsequences (same or different). Examples of artificial sequences includemutants of a naturally occurring sequence and de novo designedsequences.

As used herein, the term “hybridizes under stringent conditions” refersto conditions for hybridization in 6× sodium chloride/sodium citrate(SSC) at 45° C., followed by two washes in 0.2×SSC, 0.1% SDS at 65° C.The invention also features nucleic acids that hybridize under stringentconditions to a nucleic acid described herein or to a nucleic acidencoding a polypeptide described herein.

The term “binding preference” refers to the discriminative property of apolypeptide for selecting one nucleic acid binding site relative toanother. For example, when the polypeptide is limiting in quantityrelative to two different nucleic acid binding sites, a greater amountof the polypeptide will bind the preferred site relative to the othersite in an in vivo or in vitro assay described herein.

As used herein, the “dissociation constant” refers to the equilibriumdissociation constant of a protein (e.g., a zinc finger protein) forbinding to a target site of interest. In the case of zinc finger proteinthat recognizes a target site between 9 and 18 basepairs, the binding isevaluated in the context of a 28-basepair double-stranded DNA. Thedissociation constant is determined by gel shift analysis using purifiedprotein that is bound in 20 mM Tris pH 7.7, 120 mM NaCl, 5 mM MgCl₂, 20μM ZnSO₄, 10% glycerol, 0.1% Nonidet P-40, 5 mM DTT, and 0.10 mg/mL BSA(bovine serum albumin) at room temperature. Additional details areprovided in Example 1 and Rebar and Pabo ((1994) Science 263:671-673).Dissociation constants can be less than 10⁻⁶, 10⁻⁷, 10⁻⁸, or 10⁻⁹ M.

One polypeptide (the “competing polypeptide”) can be said to “compete”with another (a polypeptide of interest) for a binding site, if, in anin vitro assay using probe molecules with the target site, the competingpolypeptide is present at a concentration no more than 10-fold greaterthan the concentration of the polypeptide of interest, the number ofprobe molecules bound by the polypeptide of interest is reduced at least25%. These experiments are done at about 50 fold the Kd of thepolypeptide of interest for the probe molecule.

A given zinc finger domain is said to “bind specifically” to a given3-base pair DNA site if a chimeric protein that includes fingers 1 and 2of Zif268 and the given zinc finger domain has an affinity of at least 5nM for a target site that includes both the given 3-base pair DNA siteand the 5-bp sequence, 5′-GGGCG-3′, that is recognized by fingers 1 and2 of Zif268. The terms “recognize” and “specifically bind” are usedinterchangeably and refer to the discrimination for a binding site by azinc finger domain in the above Zif268 fusion assay.

As used herein, “degenerate oligonucleotides” refers to both (a) apopulation of different oligonucleotides that each encode a particularamino acid sequence, and (b) a single species of oligonucleotide thatcan anneal to more than one sequence, e.g., an oligonucleotide with anunnatural nucleotide such as inosine.

An “isolated composition” refers to a composition that is removed fromat least 90% of at least one component of a cellular sample or reactionmixture from which the isolated composition can be obtained.Compositions produced artificially or naturally can be “compositions ofat least” a certain degree of purity if the species or population ofspecies of interests is at least 5, 10, 25, 50, 75, 80, 90, 92, 95, 98,or 99% pure on a weight-weight basis. Any protein or nucleic acidcomposition described herein can be provided in an isolated form.

The use of zinc finger domains is particularly advantageous. First, thezinc finger structure is capable of recognizing very diverse DNAsequences, but any particular zinc finger can have a high degree ofspecificity for a particular sequence. Second, the structure ofnaturally occurring zinc finger proteins is modular. For example, thezinc finger protein Zif268, also called “Egr-1,” is composed of a tandemarray of three zinc finger domains. Pavletich and Pabo describe thex-ray crystallographic structure of a fragment of the zinc fingerprotein Zif268. Pavletich and Pabo (1991) Science 252:809-817. In thisstructural model, the three Zif268 fingers are complexed with DNA. Eachfinger independently contacts 3-4 basepairs of the DNA recognition site.High affinity binding is achieved by the cooperative effect of havingmultiple zinc finger modules in the same polypeptide chain.

The present invention avails itself of all the zinc finger domainspresent in the human genome, or any other genome. This diverse samplingof sequence space occupied by the zinc finger domain structural fold mayhave the additional advantages inherent in eons of natural selection.Moreover, by utilizing domains from the host species, a zinc fingerprotein engineered for a gene therapy application by the methodsdescribed herein has a reduced likelihood of being regarded as foreignby the host immune response. It is also possible to use non-naturallyoccurring zinc finger domains, e.g., variants of human or mammalian zincfinger domains or completely artificial zinc finger domains.

The ability to select a DNA binding domain that recognizes a particularsequence permits the design of novel proteins that specifically regulatea target gene, such an endogenous cellular gene. In manyimplementations, the proteins have therapeutic or industrialapplications. Other applications are also possible.

This disclosure also includes a number of examples that demonstrate,using particular embodiments, that zinc finger proteins generally can beused as a therapeutic for treating cancer. The examples show that zincfinger proteins can function as powerful inhibitors of VEGF-Aexpression. Since VEGF-A contributes to angiogenesis in tumor tissues,zinc finger proteins that modulate (e.g., inhibit) VEGF-A can be used,e.g., to reduce angiogenesis in and near tumors.

All patents, patent applications, and references cited herein areincorporated by reference in their entirety. The following patentapplications: WO 01/60970 (Kim et al.); U.S. Published Applications2002-0061512, 2003-165997, and 2003-194727, and U.S. Ser. Nos.10/669,861, 60/431,892 and 60/477,459 are expressly incorporated byreference in their entirety for all purposes. The details of one or moreembodiments of the invention are set forth in the accompanying drawingsand the description below. Other features, objects, and advantages ofthe invention will be apparent from the description and drawings, andfrom the claims.

DESCRIPTION OF DRAWINGS

FIGS. 1A, 1B, and 1C list the nucleic acid sequence (SEQ ID NO: 120) ofan exemplary region of the human VEGF-A gene. The region includes thepromoter. The sequence is from GENBANK® entry AF095785.1. Thetranscriptional initiation site is at about nucleotide 2363. The startcodon is at about nucleotide 3401.

FIGS. 2A, 2B, 2C, 2D, 2E, and 2F list the nucleic acid sequence (SEQ IDNO:121) of an exemplary region of the human transforming protein (FGF4)gene. The region includes the promoter. The sequence is from GENBANK®entry J02986.1 and AP006345.2 (Homo sapiens genomic DNA, chromosome 11clone:RP11-186D19, complete sequence). The transcriptional initiationsite is at about nucleotide 3731. The start codon is at about nucleotide3959.

FIGS. 3A, 3B, 3C, 3D, and 3E list the nucleic acid sequence (SEQ IDNO:122) of an exemplary region of the human hepatocyte growth factor(HGF) gene. The region includes the promoter. The sequence is fromGENBANK® entry AC004960.1 for Homo sapiens PAC clone RP5-1098B1 from7q11.23-q21 The transcriptional initiation site is at about nucleotide4389. The start codon is at about nucleotide 4454.

FIG. 4 is a schematic of the VEGF-A promoter.

FIG. 5A provides schematics of exemplary nucleic acid constructs forexpressing zinc finger proteins with KRAB domains.

FIG. 5B provides a schematic of an exemplary luciferase reporterconstruct that contains the VEGF-A promoter.

DETAILED DESCRIPTION

Chimeric zinc finger proteins that include at least one zinc fingerdomain can be used to regulate the expression of genes within cells.Zinc finger protein can include two or more naturally-occurring zincfinger domains. In one set of examples, chimeric zinc finger proteinsare used to regulate the VEGF-A gene in a mammalian cell.

Chimeric zinc finger proteins can be obtained by a variety of methods.

In one embodiment, these proteins are designed to recognize a target DNAsite. Useful target sites include sites in a regulatory region of thetarget gene or within 1 kb or 500 bp of a regulatory region of a targetgene. For example, the target site can be within 1 kb or 500 bp of theTATA box or transcriptional start site of a gene. One method fordesigning a zinc finger protein includes parsing target sites into 3 or4 basepair sequences that can be recognized by an individual zinc fingerdomain. Then a nucleic acid is constructed which includes a sequencethat encodes a protein that has consecutive zinc finger domainscorresponding to the parsed elements. A plurality of different nucleicacids that encode candidate proteins is constructed and expressed in ahost cell. The expression of the target gene is evaluated to identifyone or more of the candidates that is able to regulate expression of thetarget gene.

In another embodiment, a chimeric zinc finger protein is selected from alibrary of zinc finger domains based on its phenotypic effect in a cell.For example, a nucleic acid library that encodes random chimeras of zincfinger domains is transformed into mammalian culture cells. Nucleicacids of the library are expressed in the cells. The cells are evaluatedfor a phenotype of interest, and cells in which the phenotype is alteredrelative to a control are isolated. The library nucleic acids in suchcells are recovered, and the zinc finger protein encoded by suchrecovered nucleic acids can be further characterized, utilized, ormodified.

Zinc Finger Domains

Zinc finger domains are small polypeptide domains of approximately 30amino acid residues in which there are four residues, either cysteine orhistidine, appropriately spaced such that they can coordinate a zinc ion(for reviews, see, e.g., Klug and Rhodes, (1987) Trends Biochem. Sci.12:464-469(1987); Evans and Hollenberg, (1988) Cell 52:1-3; Payre andVincent, (1988) FEBS Lett. 234:245-250; Miller et al., (1985) EMBO J.4:1609-1614; Berg, (1988) Proc. Natl. Acad. Sci. U.S.A. 85:99-102;Rosenfeld and Margalit, (1993) J. Biomol. Struct. Dyn.11:557-570).Hence, zinc finger domains can be categorized according to the identityof the residues that coordinate the zinc ion, e.g., as the Cys₂-His₂class, the Cys₂-Cys₂ class, the Cys₂-CysHis class, and so forth. Thezinc coordinating residues of Cys₂-His₂ zinc fingers are typicallyspaced as follows: C-X₂₋₅-C-X₃-X_(a)-X₅-ψ-X₂-H-X₃₋₅-H, (SEQ ID NO:123)

where ψ (psi) is a hydrophobic residue (Wolfe et al., (1999) Annu. Rev.Biophys. Biomol. Struct. 3:183-212), “X” represents any amino acid, thesubscript number indicates the number of amino acids, and a subscriptwith two hyphenated numbers indicates a typical range of interveningamino acids. In many zinc finger domains, the initial cysteine ispreceded by phenylalanine or tyrosine and then a non-cysteine aminoacid. Typically, the intervening amino acids fold to form ananti-parallel β-sheet that packs against an α-helix, although theanti-parallel β-sheets can be short, non-ideal, or non-existent. Thefold positions the zinc-coordinating side chains so they are in atetrahedral conformation appropriate for coordinating the zinc ion. Thebase contacting residues are in the loop region between the pair ofmetal chelating residues.

For convenience, the primary DNA contacting residues of a zinc fingerdomain are numbered: −1, 2, 3, and 6 based on the following example:(SEQ ID NO:124)                 −1 1 2 3 4  5 6C-X₂₋₅-C-X₃-X_(a)-X-R-X-D-E-X_(b)-X-R-H-X₃₋₅-H,

As noted in the example above, the DNA contacting residues are Arg (R),Asp (D), Glu (E), and Arg (R). The above motif can be abbreviated RDER.As used herein, such abbreviation is a shorthand that refers to aparticular polypeptide sequence from the second residue preceding thefirst cysteine (above, initial residue of SEQ ID NO:124) to the ultimatemetal-chelating histidine (ultimate residue of SEQ ID NO:124). In theabove motif and others, X_(a) is frequently aromatic, and X_(b) isfrequently hydrophobic. Where two different sequences have the samemotif, a number may be used to indicate each sequence (e.g., RDER1 orRDER2).

In certain contexts where made explicitly apparent, the four-letterabbreviation refers to the motif in general. In other words, the motifspecifies the amino acids at positions −1, 2, 3, and 6, while the otherpositions can be any amino acid, typically, but not necessarily, anon-cysteine amino acid. The small letter “m” before a motif can be usedto make explicit that the abbreviation is referring to a motif. Forexample, mRDER refers to a motif in which R appears at positions −1, Dat position 2, E at position 3, and R at position 6.

A zinc finger DNA-binding protein may consist of a tandem array of threeor more zinc finger domains.

The zinc finger domain (or “ZFD”) is one of the most common eukaryoticDNA-binding motifs, found in species from yeast to higher plants and tohumans. By one estimate, there are at least several thousand zinc fingerdomains in the human genome alone, possibly at least 4,500. Zinc fingerdomains can be identified in or isolated from zinc finger proteins.Non-limiting examples of zinc finger proteins include CF2-II; Kruppel;WT1; basonuclin; BCL-6/LAZ-3; erythroid Kruppel-like transcriptionfactor; transcription factors Sp1, Sp2, Sp3, and Sp4; transcriptionalrepressor YY1; EGR1/Krox24; EGR2/Krox20; EGR3/Pilot; EGR4/AT133; Evi-1;GLI1; GLI2; GLI3; HIV-EP1/ZNF40; HIV-EP2; KR1; Zfx; ZfY; and ZNF7.

An artificial transcription factor can include chimeras of availablezinc finger domain. In one embodiment, one or more of the zinc fingerdomains is naturally occurring. Many exemplary human zinc finger domainsare described in US 2002-0061512, US 2003-165997, and U.S. Ser. No.60/431,892. See also Table 6 below. The binding specificities of eachdomain, can be used to design a transcription factor with a particularspecificity. TABLE 6 Exemplary Zinc Finger Domains SEQ ID ZFD Amino AcidSequence NO: CSNR1 YKCKQCGKAFGCPSNLRRHGRTH 1 DSAR2YSCGICGKSFSDSSAKRRHCILH 2 DSCR YTCSDCGKAFRDKSCLNRHRRTH 3 QSHR2YKCGQCGKFYSQVSHLTRHQKIH 4 QSHT YKCEECGKAFRQSSHLTTHKIIH 5 QSNR3YECEKCGKAFNQSSNLTRHKKSH 6 QSNV2 YVCSKCGKAFTQSSNLTVHQKIH 7 QSSR1YKCPDCGKSFSQSSSLIRHQRTH 8 RDER1 YVCDVEGCTWKFARSDELNRHKKRH 9 RDHTFQCKTCQRKFSRSDHLKTHTRTH 10 RSHR YKCMECGKAFNRRSHLTRHQRIH 11 RSNRYICRKCGRGFSRKSNLIRHQRTH 12 VSNV YECDHCGKAFSVSSNLNVHRRIH 13 VSSRYTCKQCGKAFSVSSSLRRHETTH 14 VSTR YECNYCGKTFSVSSTLIRHQRIH 15 WSNRYRCEECGKAFRWPSNLTRHKRIH 16 QSHV YECDHCGKSFSQSSHLNVHKRTH 17 RDHR1FLCQYCAQRFGRKDHLTRHMKKS 18 DSNRa# YRCKYCDRSFSDSSNLQRHVRNIH 19#indicates that the domain is not a naturally occurring human domain.

Additional exemplary zinc finger domains include domains with thefollowing motifs:

mCSNR, mDSAR, mDSCR, mISNR, mQFNR, mQSHV, mQSNI, mQSNK, mQSNR, mQSNV,mQSSR, mQTHQ, mQTHR, mRDER, mRDHT, mRDKR, mRSHR, mRSNR, mVSNV, mVSSR,mVSTR, mWSNR, mDGNV, mDSNR, and mRDNQ.

It is also possible to use other types of DNA binding domains, e.g., atleast one domain other than a zinc finger domain. The invention utilizescollections of nucleic acid binding domains with differing bindingspecificities. A variety of protein structures are known to interactnucleic acids with high affinity and high specificity. For reviews ofstructural motifs which recognize double stranded DNA, see, e.g., Paboand Sauer (1992) Annu. Rev. Biochem. 61:1053-95; Patikoglou and Burley(1997) Annu. Rev. Biophys. Biomol. Struct. 26:289-325; Nelson (1995)Curr Opin GenetDev. 5:180-9). A few non-limiting examples of nucleicacid binding domains, other than zinc finger domains, include:homeodomains, helix-turn-helix domains, winged helix domains, andhelix-loop-helix domains.

Transcription Factor Features

In addition to a DNA-binding domain, a transcription factor mayoptionally include a regulatory domain, a nuclear localization signal,or other feature described herein.

Activation domains. Transcriptional activation domains that may be usedin the present invention include but are not limited to the Gal4activation domain from yeast and the VP16 domain from herpes simplexvirus. The ability of a domain to activate transcription can bevalidated by fusing the domain to a known DNA binding domain and thendetermining if a reporter gene operably linked to sites recognized bythe known DNA-binding domain is activated by the fusion protein.

An exemplary activation domain is the following domain from p65: (SEQ IDNO:73) YLPDTDDRHRIEEKRKRTYETFKSIMKKSPFSGPTDPRPPPRRIAVPSRSSASVPKPAPQPYPFTSSLSTINYDEFPTMVFPSGQISQASALAPAPPQVLPQAPAPAPAPAMVSALAQAPAPVPVLAPGPPQAVAPPAPKPTQAGEGTLSEALLQLQFDDEDLGALLGNSTDPAVFTDLASVDNSEFQQLLNQGIPVAPHTTEPMLMEYPEAITRLVTAQRPPDPAPAPLGAPGLPNGLLSGDEDFSSIA DMDFSALLSQ

The sequence of an exemplary Gal4 activation domain is as follows: (SEQID NO:74) NFNQSGNIADSSLSFTFTNSSNGPNLITTQTNSQALSQPIASSNVHDNFMNNEITASKIDDGNNSKPLSPGWTDQTAYNAFGITTGMFNTTTMDDVYNYLFDDEDTPPNPKKEISMAYPYDVPDYAS

In bacteria, activation domain function can be emulated by a domain thatrecruits a wild-type RNA polymerase alpha subunit C-terminal domain or amutant alpha subunit C-terminal domain, e.g., a C-terminal domain fusedto a protein interaction domain.

Repression domains. If desired, a repression domain instead of anactivation domain can be fused to the DNA binding domain. Examples ofeukaryotic repression domains include repression domains from Kid, UME6,ORANGE, groucho, and WRPW (see, e.g., Dawson et al., (1995) Mol. CellBiol. 15:6923-31). The ability of a domain to repress transcription canbe validated by fusing the domain to a known DNA binding domain and thendetermining if a reporter gene operably linked to sites recognized bythe known DNA-binding domain is repressed by the fusion protein.

A first exemplary repression domain is the “KRAB” domain from the Kidprotein (Witzgall R. et al. (1994) Proc, Natl. Acad. Sci. U.S.A.,91(10): 4514-8): (SEQ ID NO:75)VSVTFEDVAVLFTRDEWKKLDLSQRSLYREVMLENYSNLASMAGFLFTKP KVISLLQQGEDPW

A second exemplary repression domain is the KOX repression domain. Thisdomain includes the “KRAB” domain from the human Kox1 protein (Zincfinger protein 10; NCBI protein database AAH24182; GI:18848329), i.e.,amino acids 2-97 of Kox1: (SEQ ID NO:72)DAKSLTAWSRTLVTFKDVFVDFTREEWKLLDTAQQIVYRNVMLENYKNLVSLGYQLTKPDVILRLEKGEEPWLVEREIHQETHPDSETAFEIKSSV

A third exemplary repression domain is the following domain from UME6protein: (SEQ ID NO:119)NSASSSTKLDDDLGTAAAVLSNMRSSPYRTHDKPISNVNDMNNTNALGVPASRPHSSSFPSKGVLRPILLRIHNSEQQPIFESNNSTACI

The WRPW domain is still another example of a repression domain.

Still other chimeric transcription factors include neither an activationor repression domain. Rather, such transcription factors may altertranscription by displacing or otherwise competing with a boundendogenous transcription factor (e.g., an activator or repressor).

Other Functional Domains. Examples of other functional domains include ahistone modifying enzyme (e.g., a histone acetylase or deacetylase), aDNA modifying enzyme (e.g., a methylase), and so forth.

A protein transduction domain can be fused to the zinc finger protein.Protein transduction domains result in uptake of the transduction domainand attached polypeptide into cells. A “protein transduction domain” or“PTD” is an amino acid sequence that can cross a biological membrane,particularly a cell membrane. When attached to a heterologouspolypeptide, a PTD can enhance the translocation of the heterologouspolypeptide across a biological membrane. The PTD is typicallycovalently attached (e.g., by a peptide bond) to the heterologous DNAbinding domain. For example, the PTD and the heterologous DNA bindingdomain can be encoded by a single nucleic acid, e.g., in a common openreading frame or in one or more exons of a common gene. An exemplary PTDcan include between 10-30 amino acids and may form an amphipathic helix.Many PTD's are basic in character, e.g., include at least 4, 5, 6 or 8basic residues (e.g., arginine or lysine). A PTD may be able to enhancethe translocation of a polypeptide into a cell that lacks a cell wall ora cell from a particular species, e.g., a eukaryotic cell, e.g., avertebrate cell, e.g., a mammalian cell, such as a human, simian,murine, bovine, equine, feline, or ovine cell.

Typically a PTD is linked to a zinc finger protein by producing the DNAbinding domain of the zinc finger protein and the PTD as a singlepolypeptide chain, but other methods of for physically associating a PTDcan be used. For example, the PTD can be associated by a non-covalentinteraction (e.g., using biotin-avidin, coiled-coils, etc.) Moretypically, a PTD can be linked to a zinc finger protein, for example,using a flexible linker. Flexible linkers can include one or moreglycine residues to allow for free rotation. For example, the PTD can bespaced from a DNA binding domain of the transcription factor by at least10, 20, or 50 amino acids. A PTD can be located N- or C-terminalrelative to a DNA binding domain.

An zinc finger protein can also include a plurality of PTD's, e.g., aplurality of different PTD's or at least two copies of one PTD.

Exemplary PTD's include the following segments from the antennapediaprotein, the herpes simplex virus VP22 protein and HIV TAT protein.

Tat. The Tat protein from Human Immunodeficiency virus type I (HIV-1)has the remarkable capacity to enter cells when added exogenously(Frankel A. D. and Pabo C. O. (1988) Cell 55:1189-1193, Mann D. A andFrankel A. D. (1991) EMBO J. 10:1733-1739, Fawell et al. (1994) Proc.Natl. Acad. Sci. USA 91:664-668). The minimal Tat PTD includes residues47-57 of the human immunodeficiency virus Tat protein. This peptidesequence is referred to as “TAT” herein.

Antennapedia. The antennapedia homeodomain also includes a peptide thatis a PTD. Derossi et al. (1994) J. Bio. Chem. 269:10444-10450. Thispeptide, also referred to as “Penetratin.”,

VP22. The HSV VP22 protein also includes a PTD. This PTD is located atthe VP22 C-terminal 34 amino acid residues. See, e.g., Elliott andO'Hare (1997) Cell 88:223-234 and U.S. Pat. No. 6,184,038.

Another exemplary PTD is a poly-arginine sequence, e.g., a sequence thatincludes at least 4, 5, 6 or 8 arginine residues, e.g., between 5 and 10arginine residues.

Cell-specific PTD's. Some PTD's are specific for particular cell typesor states. One exemplary cell-specific PTD is the Hn1 synthetic peptidedescribed in U.S. Published Application 2002-0102265. Hn1 isinternalized by human head and neck squamous carcinoma cells and can beused to target an artificial transcription factor to a carcinoma, e.g.,a carcinoma of the head or neck. or closely related sequences. U.S.Published Application 2002-0102265 also describes a general method forusing phage display to identify other peptides and proteins which canfunction as cell specific PTD's. For additional information about PTD's,see also U.S. 2003-0082561; U.S. 2002-0102265; U.S. 2003-0040038;Schwarze et al. (1999) Science 285:1569-1572; Derossi et al. (1996) J.Biol. Chem. 271:18188; Hancock et al. (1991) EMBO J. 10:4033-4039; Busset al. (1988) Mol. Cell. Biol. 8:3960-3963; Derossi et al. (1998) Trendsin Cell Biology 8:84-87; Lindgren et al. (2000) Trends inPharmacological Sciences 21:99-103; Kilic et al. (2003) Stroke34:1304-10; Asoh et al. (2002) Proc Natl Acad Sci USA 99(26):17107-12;and Tanaka et al. (2003) J Immunol. 170(3):1291-8.

Design of Novel DNA-Binding Proteins

In one embodiment, a zinc finger protein is rationally designed bymixing and matching characterized zinc finger domains so that eachdomain recognizes one segment of the target site. Zinc finger domainscan be isolated and characterized, e.g., using the methods described inUS 2002-0061512 and 2003-165997. The modular structure of zinc fingerdomains facilitates their rearrangement to construct new DNA-bindingproteins. Zinc finger domains in the naturally-occurring Zif268 proteinare positioned in tandem along the DNA double helix. Each domainindependently recognizes a different 3-4 basepair DNA segment.

A Database of Zinc Finger Domains. The one-hybrid selection systemdescribed above can be utilized to identify one or more zinc fingerdomains for each possible 3- or 4-basepair binding site or arepresentative number of such binding sites. The results of this processcan be accumulated as a series of associations between a zinc fingerdomain and its preferred 3- or 4-basepair binding site or sites.Examples of such associations are provided in US 2002-0061512 and2003-165997.

The results can also be stored in a machine as a database, e.g., arelational database, spreadsheet, or text file. Each record of such adatabase associates a representation of a zinc finger domain and astring indicating the sequence of the one or more preferred bindingsites of the domain. The database record can include an indication ofthe relative affinity of the zinc finger domains that bind each site. Insome implementations, the database record can also include informationthat indicates the physical location of the nucleic acid encoding theparticular zinc finger domain. Such a physical location can be, forexample, a particular well of a microtitre plate stored in a freezer.

The database can be configured so that it can be queried or filtered,e.g., using a SQL operating environment, a scripting language (such asPERL or a MICROSOFT EXCEL® macro), or a programming language. Such adatabase would enable a user to identify one or more zinc finger domainsthat recognizes a particular 3- or 4-basepair binding site. Database andother information such as can be stored on a database server can also beconfigured to communicate with each device using commands and othersignals that are interpretable by the device. The computer-based aspectsof the system can be implemented in digital electronic circuitry, or incomputer hardware, firmware, software, or in combinations thereof. Anapparatus of the invention, e.g., the database server, can beimplemented in a computer program product tangibly embodied in amachine-readable storage device for execution by a programmableprocessor; and method actions can be performed by a programmableprocessor executing a program of instructions to perform functions ofthe invention by operating on input data and generating output. Onenon-limiting example of an execution environment includes computersrunning WINDOWS XP® or WINDOWS NT 4.0® (Microsoft, Redmond Wash.),LINUX™, or other operating systems.

The zinc finger domains can also be tested in the context of multipledifferent fusion proteins to verify their specificity. Moreover,particular binding sites for which a paucity of domains is available canbe the target of additional selection screens. Libraries for suchselections can be prepared by mutagenizing a zinc finger domain thatbinds a similar yet distinct site. A complete matrix of zinc fingerdomains for each possible binding site is not essential, as the domainscan be staggered relative to the target binding site in order to bestutilize the domains available. Such staggering can be accomplished bothby parsing the binding site in the most useful 3 or 4 basepair bindingsites, and also by varying the linker length between zinc fingerdomains. In order to incorporate both selectivity and high affinity intothe design polypeptide, zinc finger domains that have high specificityfor a desired site can be flanked by other domains that bind with higheraffinity, but lesser specificity. The in vivo screening methodsdescribed in US 2002-0061512 and 2003-165997 can be used to test the invivo function, affinity, and specificity of an artificially assembledzinc finger protein and derivatives thereof. Likewise, these methods canbe used to optimize such assembled proteins, e.g., by creating librariesof varied linker composition, varied zinc finger domain modules, variedzinc finger domain compositions, and so forth.

Parsing a target site. The target 9-bp or longer DNA sequence is parsedinto 3- or 4-bp segments. Zinc finger domains are identified (e.g., froma database described above) that recognize each parsed 3- or 4-bpsegment. Longer target sequences, e.g., 20 bp to 500 bp sequences, arealso suitable targets as 9 bp, 12 bp, and 15 bp subsequences can beidentified within them. In particular, subsequences amenable for parsinginto sites well represented in the database can serve as initial designtargets.

A scoring regime can be used to estimate the probability that aparticular chimeric zinc finger protein would recognize the target sitein the cell. The scores can be a function of each component finger'saffinity for its preferred subsites, its specificity, and its success inpreviously designed proteins.

Computer Programs. Computer systems and software can be used to access amachine-readable database described above, parse a target site, andoutput one or more chimeric zinc finger protein designs.

The techniques may be implemented in programs executing on programmablemachines such as mobile or stationary computers, and similar devicesthat each include a processor, a storage medium readable by theprocessor, and one or more output devices. Each program may beimplemented in a high level procedural or object oriented programminglanguage to communicate with a machine system. Some merely illustrativeexamples of computer languages include C, C++, JAVA™, Fortran, andVISUAL BASIC™.

Each such program may be stored on a storage medium or device, e.g.,compact disc read only memory (CD-ROM), hard disk, magnetic diskette, orsimilar medium or device, that is readable by a general or specialpurpose programmable machine for configuring and operating the machinewhen the storage medium or device is read by the computer to perform theprocedures described in this document. The system may also beimplemented as a machine-readable storage medium, configured with aprogram, where the storage medium so configured causes a machine tooperate in a specific and predefined manner.

The computer system can be connected to an internal or external network.For example, the computer system can receive requests from a remotelylocated client system, e.g., using HTTP, HTTPS, or XML protocols. Therequests can be an identifier for a known target gene or a stringrepresenting the sequence of a target nucleic acid. In the former case,the computer system can access a sequence database such as GENBANK® toretrieve the nucleic acid sequence of regulatory regions of the targetgene. The sequence of the regulatory region or the directly-receivedtarget nucleic acid sequence is then parsed into subsites, and chimericzinc finger proteins are designed, e.g., as described above.

The system can communicate the results to the remotely located client.Alternatively, the system can control a robot to physically retrievenucleic acid encoding the chimeric zinc finger proteins. In thisimplementation, a library of nucleic acids encoding chimeric zinc fingerproteins is constructed and stored, e.g., as frozen purified DNA orfrozen bacterial strains harboring the nucleic acids. The robot respondsto signals from the computer system by accessing specified addresses ofthe library. The retrieved nucleic acids can then be processed, packagedand delivered to the client. Alternatively, the retrieved nucleic acidscan be introduced into cells and assayed. The computer system can thencommunicate the results of the assay to the client across the network.

Constructing a Protein from Selected Modules. Once a chimericpolypeptide sequence containing multiple zinc finger domains isdesigned, a nucleic acid sequence encoding the designed polypeptidesequence can be synthesized. Methods for constructing synthetic genesare routine in the art. Such methods include gene construction fromcustom synthesized oligonucleotides, PCR mediated cloning, andmega-primer PCR. In one example, nucleic acids encoding selected zincfinger domains are serially ligated to form a nucleic acid encoding achimeric polypeptide. Additional sequences can be joined to the nucleicacid encoding the designed polypeptide sequence. The additional sequencecan itself provide regulatory functions or can encode an amino acidsequence with a desired function.

Profiling Regulatory Properties of a Chimeric Zinc Finger Protein

A chimeric zinc finger protein can be characterized to determine itsability to regulate one or more endogenous genes in a cell, e.g., amammalian cell. Nucleic acid encoding the chimeric zinc finger proteinis first fused to a repression or activation domain, and then introducedinto a cell of interest. After appropriate incubation and induction ofexpression of the coding nucleic acid, mRNA is harvested from the celland analyzed using a nucleic acid microarray.

Nucleic acid microarrays can be fabricated by a variety of methods,e.g., photolithographic methods (see, e.g., U.S. Pat. No. 5,510,270),mechanical methods (e.g., directed-flow methods as described in U.S.Pat. No. 5,384,261), and pin based methods (e.g., as described in U.S.Pat. No. 5,288,514). The array is synthesized with a unique captureprobe at each address, each capture probe being appropriate to detect anucleic acid for a particular expressed gene.

The mRNA can be isolated by routine methods, e.g., including DNasetreatment to remove genomic DNA and hybridization to an oligo-dT coupledsolid substrate (e.g., as described in Current Protocols in MolecularBiology, John Wiley & Sons, N.Y). The substrate is washed, and the mRNAis eluted. The isolated mRNA is then reversed transcribed and optionallyamplified, e.g., by rtPCR, e.g., as described in (U.S. Pat. No.4,683,202). The nucleic acid can be labeled during amplification orreverse transcription, e.g., by the incorporation of a labelednucleotide. Examples of preferred labels include fluorescent labels,e.g., red-fluorescent dye Cy5 (Amersham) or green-fluorescent dye Cy3(Amersham). Alternatively, the nucleic acid can be labeled with biotin,and detected after hybridization with labeled streptavidin, e.g.,streptavidin-phycoerythrin (Molecular Probes).

The labeled nucleic acid is then contacted to the array. In addition, acontrol nucleic acid or a reference nucleic acid can be contacted to thesame array. The control nucleic acid or reference nucleic acid can belabeled with a label other than the sample nucleic acid, e.g., one witha different emission maximum. Labeled nucleic acids are contacted to anarray under hybridization conditions. The array is washed, and thenimaged to detect fluorescence at each address of the array.

A general scheme for producing and evaluating profiles is includesdetecting hybridization at each address of the array. The extent ofhybridization at an address is represented by a numerical value andstored, e.g., in a vector, a one-dimensional matrix, or one-dimensionalarray. The vector x has a value for each address of the array. Forexample, a numerical value for the extent of hybridization at aparticular address is stored in variable x_(a). The numerical value canbe adjusted, e.g., for local background levels, sample amount, and othervariations. Nucleic acid is also prepared from a reference sample andhybridized to the same or a different array. The vector y is constructidentically to vector x. The sample expression profile and the referenceprofile can be compared, e.g., using a mathematical equation that is afunction of the two vectors. The comparison can be evaluated as a scalarvalue, e.g., a score representing similarity of the two profiles. Eitheror both vectors can be transformed by a matrix in order to add weightingvalues to different genes detected by the array.

The expression data can be stored in a database, e.g., a relationaldatabase such as a SQL database (e.g., Oracle or Sybase databaseenvironments). The database can have multiple tables. For example, rawexpression data can be stored in one table, wherein each columncorresponds to a gene being assayed, e.g., an address or an array, andeach row corresponds to a sample. A separate table can store identifiersand sample information, e.g., the batch number of the array used, date,and other quality control information.

Genes that are similarly regulated can be identified by clusteringexpression data to identify coregulated genes. Such cluster may beindicative of a set of genes coordinately regulated by the chimeric zincfinger protein. Genes can be clustered using hierarchical clustering(see, e.g., Sokal and Michener (1958) Univ. Kans. Sci. Bull. 38:1409),Bayesian clustering, k-means clustering, and self-organizing maps (see,Tamayo et al. (1999) Proc. Natl. Acad. Sci. USA 96:2907).

The similarity of a sample expression profile to a reference expressionprofile (e.g., a control cell) can also be determined, e.g., bycomparing the log of the expression level of the sample to the log ofthe predictor or reference expression value and adjusting the comparisonby the weighting factor for all genes of predictive value in theprofile.

Additional Features for Designed Transcription Factors

Peptide Linkers. DNA binding domains can be connected by a variety oflinkers. The utility and design of linkers are well known in the art. Aparticularly useful linker is a peptide linker that is encoded bynucleic acid. Thus, one can construct a synthetic gene that encodes afirst DNA binding domain, the peptide linker, and a second DNA bindingdomain. This design can be repeated in order to construct large,synthetic, multi-domain DNA binding proteins. PCT WO 99/45132 and Kimand Pabo ((1998) Proc. Natl. Acad. Sci. USA 95:2812-7) describe thedesign of peptide linkers suitable for joining zinc finger domains.

Additional peptide linkers are available that form random coil,α-helical or β-pleated tertiary structures. Polypeptides that formsuitable flexible linkers are well known in the art (see, e.g., Robinsonand Sauer (1998) Proc Natl Acad Sci USA. 95:5929-34). Flexible linkerstypically include glycine, because this amino acid, which lacks a sidechain, is unique in its rotational freedom. Serine or threonine can beinterspersed in the linker to increase hydrophilicity. In additional,amino acids capable of interacting with the phosphate backbone of DNAcan be utilized in order to increase binding affinity. Judicious use ofsuch amino acids allows for balancing increases in affinity with loss ofsequence specificity. If a rigid extension is desirable as a linker,α-helical linkers, such as the helical linker described in Pantoliano etal. (1991) Biochem. 30:10117-10125, can be used. Linkers can also bedesigned by computer modeling (see, e.g., U.S. Pat. No. 4,946,778).Software for molecular modeling is commercially available (e.g., fromMolecular Simulations, Inc., San Diego, Calif.). The linker isoptionally optimized, e.g., to reduce antigenicity and/or to increasestability, using standard mutagenesis techniques and appropriatebiophysical tests as practiced in the art of protein engineering, andfunctional assays as described herein.

For implementations utilizing zinc finger domains, the peptide thatoccurs naturally between zinc fingers can be used as a linker to joinfingers together. A typical such naturally occurring linker is:Thr-Gly-(Glu or Gln)-(Lys or Arg)-Pro-(Tyr or Phe) (SEQ ID NO:125).

Dimerization Domains. An alternative method of linking DNA bindingdomains is the use of dimerization domains, especiallyheterodimerization domains (see, e.g., Pomerantz et al (1998)Biochemistry 37:965-970). In this implementation, DNA binding domainsare present in separate polypeptide chains. For example, a firstpolypeptide encodes DNA binding domain A, linker, and domain B, while asecond polypeptide encodes domain C, linker, and domain D. An artisancan select a dimerization domain from the many well-characterizeddimerization domains. Domains that favor heterodimerization can be usedif homodimers are not desired. A particularly adaptable dimerizationdomain is the coiled-coil motif, e.g., a dimeric parallel oranti-parallel coiled-coil. Coiled-coil sequences that preferentiallyform heterodimers are also available (Lumb and Kim, (1995) Biochemistry34:8642-8648). Another species of dimerization domain is one in whichdimerization is triggered by a small molecule or by a signaling event.For example, a dimeric form of FK506 can be used to dimerize two FK506binding protein (FKBP) domains. Such dimerization domains can beutilized to provide additional levels of regulation.

Functional Assays and Uses

Zinc finger proteins can be evaluated using cell-free assays andcellular assays. Examples of cell-free assays include assays in which atleast partially purified protein is evaluated for a biochemicalproperty, e.g., DNA binding in vitro. Examples of useful in vitro assaysinclude electrophoretic mobility shift assays (EMSA), DNA footprinting,DNA methylation protection assays, surface plasmon resonance,fluorescence polarization, and fluorescence resonance energy transfer(FRET). Binding and other functional properties can be assayed incellular assays or in vivo (e.g., in an organism).

For example, domains can be selected to bind to a target site, e.g., toa promoter site of a gene that modulates cell proliferation. By modularassembly, a protein can be designed that includes (1) the selecteddomains that respectively bind to subsites spanning the target promotersite, and (2) a transcriptional regulatory domain, e.g., an activationdomain or a repression domain. In an example in which the proteinregulates a gene that modulates cell proliferation and the protein isintended to counteract cell proliferation, the appropriatetranscriptional regulatory domain can be chosen depending on whether thegene increases cell proliferation (e.g., a repression domain isselected) or decreases cell proliferation (e.g., an activation domain isselected). In another example, a library encoding random combinations ofzinc finger domains is screened to identify a chimeric zinc fingerprotein that alters a phenotype.

A nucleic acid sequence encoding a chimeric zinc finger protein can becloned into an expression vector, e.g., an inducible expression vectoras described in Kang and Kim, (2000) J Biol Chem 275:8742. The inducibleexpression vector can include an inducible promoter or regulatorysequence. Non-limiting examples of inducible promoters includesteroid-hormone responsive promoters (e.g., ecdysone-responsive,estrogen-responsive, and glutacorticoid-responsive promoters), thetetracyclin “Tet-On” and “Tet-Off” systems, and metal-responsivepromoters. The construct can be transfected into tissue culture cells orinto embryonic stem cells to generate a transgenic organism as a modelsubject. The efficacy of the chimeric zinc finger protein can bedetermined by inducing expression of the protein and assaying cellproliferation of the tissue culture cell or assaying for developmentalchanges and/or tumor growth in a transgenic animal model. In addition,the level of expression of the gene being targeted can be assayed byroutine methods to detect mRNA, e.g., RT-PCR or Northern blots. A morecomplete diagnostic includes purifying mRNA from cells expressing andnot expressing the chimeric zinc finger protein. The two pools of mRNAare used to probe a microarray containing probes to a large collectionof genes, e.g., a collection of genes relevant to the condition ofinterest (e.g., cancer) or a collection of genes identified in theorganism's genome. Such an assay is particularly valuable fordetermining the specificity of the chimeric zinc finger protein. If theprotein binds with high affinity but little specificity, it may causepleiotropic and undesirable effects by affecting expression of genes inaddition to the contemplated target. Such effects are revealed by aglobal analysis of transcripts.

In addition, the chimeric zinc finger protein can be produced in asubject cell or subject organism in order to regulate an endogenousgene. The chimeric zinc finger protein is configured, as describedabove, to bind to a region of the endogenous gene and to provide atranscriptional activation or repression function. As described in Kangand Kim (supra), the expression of a nucleic acid encoding the chimericzinc finger protein can be operably linked to a regulatable promoter(e.g., an inducible or suppressible promoter). By modulating theconcentration of an agent that can regulate the promoter, e.g., aninducer for the promoter, the expression of the endogenous gene can beregulated in a concentration dependent manner.

The binding site preference of a zinc finger protein can be verified bya biochemical assay such as EMSA, DNase footprinting, surface plasmonresonance, SELEX, or column binding. The substrate for binding can be,e.g., a synthetic oligonucleotide encompassing the target site or arestriction fragment. The assay can also include non-specific DNA as acompetitor, or specific DNA sequences as a competitor. Specificcompetitor DNAs can include the recognition site for DNA binding withone, two, or three nucleotide mutations. Thus, a biochemical assay canbe used to measure not only the affinity of a domain for a given site,but also its affinity to the site relative to other sites. Rebar andPabo, (1994) Science 263:671-673 describe a method of obtaining apparentK_(d) constants for zinc finger domains from EMSA. Exemplary zinc fingerproteins have at least 2, 5, 10, 50, 100, or 500 fold preference for aparticular recognition site relative to a related site with one, two, orthree nucleotide mutations.

A protein or nucleic acid described herein can also be evaluated, e.g.,in vitro or in vivo for a biological activity, e.g., ability to modulatea endothelial cell or to modulate angiogenesis.

Endothelial cell proliferation. A protein or nucleic acid can be testedfor endothelial proliferation inhibiting activity using a biologicalactivity assay such as the bovine capillary endothelial cellproliferation assay, the chick CAM assay, the mouse corneal assay, andevaluating the effect of the protein or nucleic acid being tested onimplanted tumors. The chick CAM assay is described, e.g., by O'Reilly,et al. in “Angiogenic Regulation of Metastatic Growth” Cell, vol. 79(2), Oct. 21, 1994, pp. 315-328. Briefly, three-day old chicken embryoswith intact yolks are separated from the egg and placed in a petri dish.After three days of incubation a methylcellulose disc containing theprotein to be tested is applied to the CAM of individual embryos. After48 hours of incubation, the embryos and CAMs are observed to determinewhether endothelial growth has been inhibited. The mouse corneal assayinvolves implanting a growth factor-containing pellet, along withanother pellet containing the suspected endothelial growth inhibitor, inthe cornea of a mouse and observing the pattern of capillaries that areelaborated in the cornea.

Angiogenesis. Angiogenesis may be assayed, e.g., using various humanendothelial cell systems, such as umbilical vein, coronary artery, ordermal cells. Suitable assays include Alamar Blue based assays(available from Biosource International) to measure proliferation;migration assays using fluorescent molecules, such as the use of BectonDickinson Falcon HTS FluoroBlock cell culture inserts to measuremigration of cells through membranes in presence or absence ofangiogenesis enhancer or suppressors; and tubule formation assays basedon the formation of tubular structures by endothelial cells on Matrigel™(Becton Dickinson).

Cell adhesion. Cell adhesion assays measure adhesion of cells topurified adhesion proteins or adhesion of cells to each other, inpresence or absence of the protein or nucleic acid being tested.Cell-protein adhesion assays measure the ability of agents to modulatethe adhesion of cells to purified proteins. For example, recombinantproteins are produced, diluted to 2.5 g1 mL in PBS, and used to coat thewells of a microtiter plate. The wells used for negative control are notcoated. Coated wells are then washed, blocked with 1% BSA, and washedagain. Compounds are diluted to 2 times final test concentration andadded to the blocked, coated wells. Cells are then added to the wells,and the unbound cells are washed off. Retained cells are labeleddirectly on the plate by adding a membrane-permeable fluorescent dye,such as calcein-AM, and the signal is quantified in a fluorescentmicroplate reader.

Cell-cell adhesion assays can be used to measure the ability of theprotein or nucleic acid being tested to modulate binding of cells toeach other. These assays can use cells that naturally or recombinantlyexpress an adhesion protein of choice. In an exemplary assay, cellsexpressing the cell adhesion protein are plated in wells of a multiwellplate together with other cells (either more of the same cell type, oranother type of cell to which the cells adhere). The cells that canadhere are labeled with a membrane-permeable fluorescent dye, such asBCECF, and allowed to adhere to the monolayers in the presence of theprotein or nucleic acid being tested. Unbound cells are washed off, andbound cells are detected using a fluorescence plate reader.High-throughput cell adhesion assays have also been described. See,e.g., Falsey J R et al., Bioconjug Chem. May-June 2001;12(3):346-53.

Tubulogenesis. Tubulogenesis assays can be used to monitor the abilityof cultured cells, generally endothelial cells, to form tubularstructures on a matrix substrate, which generally simulates theenvironment of the extracellular matrix. Exemplary substrates includeMatrigel™ (Becton Dickinson), an extract of basement membrane proteinscontaining laminin, collagen IV, and heparin sulfate proteoglycan, whichis liquid at 4° C. and forms a solid gel at 37° C. Other suitablematrices comprise extracellular components such as collagen,fibronectin, and/or fibrin. Cells are stimulated with a pro-angiogenicstimulant, and their ability to form tubules is detected by imaging.Tubules can generally be detected after an overnight incubation withstimuli, but longer or shorter time frames may also be used. Tubeformation assays are well known in the art (e.g., Jones M K et al.,1999, Nature Medicine 5:1418-1423). These assays have traditionallyinvolved stimulation with serum or with the growth factors FGF or VEGF.In one embodiment, the assay is performed with cells cultured in serumfree medium. In one embodiment, the assay is performed in the presenceof one or more pro-angiogenic agents, e.g, inflammatory angiogenicfactors such as TNF-α, or FGF, VEGF, phorbol myristate acetate (PMA),TNF-alpha, ephrin, etc.

Cell Migration. An exemplary assay for endothelial cell migration is thehuman microvascular endothelial (HMVEC) migration assay. See, e.g.,Tolsma et al. (1993) J. Cell Biol 122, 497-511. Migration assays areknown in the art (e.g., Paik J H et al., 2001, J Biol Chem276:11830-11837). In one example, cultured endothelial cells are seededonto a matrix-coated porous lamina, with pore sizes generally smallerthan typical cell size. The lamina is typically a membrane, such as thetranswell polycarbonate membrane (Corning Costar Corporation, Cambridge,Mass.), and is generally part of an upper chamber that is in fluidcontact with a lower chamber containing pro-angiogenic stimuli.Migration is generally assayed after an overnight incubation withstimuli, but longer or shorter time frames may also be used. Migrationis assessed as the number of cells that crossed the lamina, and may bedetected by staining cells with hemotoxylin solution (VWR Scientific.),or by any other method for determining cell number. In another exemplaryset up, cells are fluorescently labeled and migration is detected usingfluorescent readings, for instance using the Falcon HTS FluoroBlok(Becton Dickinson). While some migration is observed in the absence ofstimulus, migration is greatly increased in response to pro-angiogenicfactors. The assay can be used to test the effect of the protein ornucleic acid being tested on endothelial cell migration.

Sprouting assay. An exemplary sprouting assay is a three-dimensional invitro angiogenesis assay that uses a cell-number defined spheroidaggregation of endothelial cells (“spheroid”), embedded in a collagengel-based matrix. The spheroid can serve as a starting point for thesprouting of capillary-like structures by invasion into theextracellular matrix (termed “cell sprouting”) and the subsequentformation of complex anastomosing networks (Korff and Augustin, 1999, JCell Sci 112:3249-58). In an exemplary experimental set-up, spheroidsare prepared by pipetting 400 human umbilical vein endothelial cellsinto individual wells of a nonadhesive 96-well plates to allow overnightspheroidal aggregation (Korff and Augustin: J Cell Biol 143: 1341-52,1998). Spheroids are harvested and seeded in 900 μl of methocel-collagensolution and pipetted into individual wells of a 24 well plate to allowcollagen gel polymerization. Test agents are added after 30 min bypipetting 100 μl of 10-fold concentrated working dilution of the testsubstances on top of the gel. Plates are incubated at 37° C. for 24 h.Dishes are fixed at the end of the experimental incubation period byaddition of paraformaldehyde. Sprouting intensity of endothelial cellscan be quantitated by an automated image analysis system to determinethe cumulative sprout length per spheroid.

Other exemplary assays include: Ferrara and Henzel (1989) Nature380:439-443; Gospodarowicz et al. (1989) Proc. Natl. Acad. Sci. USA, 86:7311-7315; and Claffey et al. (1995) Biochim. Biophys. Acta. 1246:1-9.;Leung et al. (1989) Science 246:1306-1309; Rastinejad et al. (1989) Cell56:345-355; and U.S. Pat. No. 5,840,693. The ability of a composition tomodulate ischemia can be evaluated, e.g., using a rat hindlimb ischemiamodel (see, e.g., Takeshita, S. et al., Circulation (1998) 98: 1261-63.

Targets for Gene Regulation

The target gene can be any gene, e.g., a chromosomal gene or aheterologous gene (e.g., a transgene). The target gene can be selected,e.g., if it is useful to regulate (e.g., increase or decrease) activityof the target gene. For example, a gene required by a pathogen can berepressed, a gene required for cancerous growth can be repressed, a genepoorly expressed or encoding an unstable protein can be activated andoverexpressed, a gene that confers stress resistance can be activated,and so forth.

Examples of specific target genes include genes that encode: cellsurface proteins (e.g., glycosylated surface proteins),cancer-associated proteins, cytokines, chemokines, peptide hormones,neurotransmitters, cell surface receptors (e.g., cell surface receptorkinases, seven transmembrane receptors, virus receptors andco-receptors, extracellular matrix binding proteins, cell-bindingproteins, antigens of pathogens (e.g., bacterial antigens, malarialantigens, and so forth). Additional protein targets include enzymes suchas enolases, cytochrome P450s, acyltransferases, methylases, TIM barrelenzymes, isomerases, acyl transferases, and so forth.

Still more examples include: integrins, cell attachment molecules or“CAMs” such as cadherins, selections, N-CAM, E-CAM, U-CAM, I-CAM and soforth); proteases (e.g., subtilisin, trypsin, chymotrypsin; aplasminogen activator, such as urokinase or human tissue-typeplasminogen activator); bombesin; factor IX, thrombin; CD-4;platelet-derived growth factor; insulin-like growth factor-I and -II;nerve growth factor; fibroblast growth factor (e.g., aFGF and bFGF);epidermal growth factor (EGF); VEGF (e.g., VEGF-A); transforming growthfactor (TGF, e.g., TGF-α and TGF-β; insulin-like growth factor bindingproteins; erythropoietin; thrombopoietin; mucins; human serum albumin;growth hormone (e.g., human growth hormone); proinsulin, insulin A-chaininsulin B-chain; parathyroid hormone; thyroid stimulating hormone;thyroxine; follicle stimulating hormone; calcitonin; atrial natriureticpeptides A, B or C; leutinizing hormone; glucagon; factor VIII;hemopoietic growth factor; tumor necrosis factor (e.g., TNF-α andTNF-β); enkephalinase; Mullerian-inhibiting substance;gonadotropin-associated peptide; tissue factor protein; inhibin;activin; vascular endothelial growth factor; receptors for hormones orgrowth factors; rheumatoid factors; osteoinductive factors; aninterferon, e.g., interferon-α,β,γ; colony stimulating factors (CSFs),e.g., M-CSF, GM-CSF, and G-CSF; interleukins (ILs), e.g., IL-1, IL-2,IL-3, IL-4, etc.; decay accelerating factor; and immunoglobulins. Insome embodiments, the targetgene encodes a protein or other factor(e.g., an RNA) that is associated with a disease, e.g., cancer, aninfectious disease, inflammation, or a cardiovascular disease.

In one embodiment, the gene is a human disease gene. For example, thegene can include a mutation that encodes a defective or impaired enzymeor the gene may have a defect in a regulatory sequence (e.g., atranscriptional, translational, or splicing regulatory sequence). A zincfinger protein can be obtained that increases expression of the gene.

For example, zinc finger proteins can be designed that interact with aFGF gene, e.g., to a binding site in the sequence listed in FIG. 2A-F,or with a hepatocyte growth factor (HGF) gene, e.g., to a binding sitein the sequence listed in FIG. 3A-E. For example, the proteins mayinteract with a promoter region of these genes.

A chimeric zinc finger protein for regulating any gene can be designedto interact with one or more target sites. For example, the target sitescan be located in a coding or non-coding region of the gene. In oneembodiment, the target site is located in a regulatory region, e.g., atranscriptional regulatory region such as the promoter. In oneembodiment, the target site is located within 700, 500, 300, 200, 50,20, 10, 5, or 3 basepairs of the transcription start site, a Dnasehypersensitive site, or a transcription factor binding site. In anembodiment in which the target gene is VEGF-A, the binding site candiffer from (e.g., not overlap with) a site in Table 2 or 3 of WO02/46412. In another embodiment, the binding site does overlap with sucha site.

Gene and Cell-Based Therapeutics

DNA molecules that encode a chimeric zinc finger protein can be insertedinto a variety of DNA constructs and vectors for the purposes of genetherapy. As used herein, a “vector” is a nucleic acid molecule competentto transport another nucleic acid molecule to which it has beencovalently linked. Vectors include plasmids, cosmids, artificialchromosomes, viral elements, and RNA vectors (e.g., based on RNA virusgenomes). The vector can be competent to replicate in a host cell or tointegrate into a host DNA. Viral vectors include, e.g., replicationdefective retroviruses, adenoviruses and adeno-associated viruses.

A gene therapy vector is a vector designed for administration to asubject, e.g., a mammal, such that a cell of the subject is able toexpress a therapeutic gene contained in the vector. The gene therapyvector can contain regulatory elements, e.g., a 5′ regulatory element,an enhancer, a promoter, a 5′ untranslated region, a signal sequence, a3′ untranslated region, a polyadenylation site, and a 3′ regulatoryregion. For example, the 5′ regulatory element, enhancer or promoter canregulate transcription of the DNA encoding the therapeutic polypeptide.The regulation can be tissue specific. For example, the regulation canrestrict transcription of the desired gene to brain cells, e.g.,cortical neurons or glial cells; hematopoietic cells; or endothelialcells. Alternatively, regulatory elements can be included that respondto an exogenous drug, e.g., a steroid, tetracycline, or the like. Thus,the level and timing of expression of the therapeutic zinc fingerprotein (e.g., a polypeptide that regulates VEGF) can be controlled.

Gene therapy vectors can be prepared for delivery as naked nucleic acid,as a component of a virus, or of an inactivated virus, or as thecontents of a liposome or other delivery vehicle. See, e.g., US2003-0143266 and 2002-0150626. In one embodiment, the nucleic acid isformulated in a lipid-protein-sugar matrix to form microparticles.,e.g., having a diameter between 50 nm to 10 micrometers. The particlesmay be prepared using any known lipid (e.g.,dipalmitoylphosphatidylcholine, DPPC), protein (e.g., albumin), or sugar(e.g., lactose).

The gene therapy vectors can be delivered using a viral system.Exemplary viral vectors include vectors from retroviruses, e.g., Moloneyretrovirus, adenoviruses, adeno-associated viruses, and lentiviruses,e.g., Herpes simplex viruses (HSV). HSV, for example, is potentiallyuseful for infecting nervous system cells. See, e.g., US 2003-0147854,2002-0090716, 2003-0039636, 2002-0068362, and 2003-0104626. The genedelivery agent, e.g., a viral vector, can be produced from recombinantcells which produce the gene delivery system.

A gene therapy vector can be administered to a subject, for example, byintravenous injection, by local administration (see U.S. Pat. No.5,328,470) or by stereotactic injection (see e.g., Chen et al. (1994)Proc. Natl. Acad. Sci. USA 91:3054-3057). The gene therapy agent can befurther formulated, for example, to delay or prolong the release of theagent by means of a slow release matrix. One method of providing arecombinant zinc finger protein, is by inserting a gene therapy vectorinto bone marrow cells harvested from a subject. The cells are infected,for example, with a retroviral gene therapy vector, and grown inculture. Meanwhile, the subject is irradiated to deplete the subject ofbone marrow cells. The bone marrow of the subject is then replenishedwith the infected culture cells. The subject is monitored for recoveryand for production of the therapeutic polypeptide.

Cell based-therapeutic methods include introducing a nucleic acid thatencodes the chimeric zinc finger protein operably linked to a promoterinto a cell in culture. The chimeric zinc finger protein can be selectedto regulate an endogenous gene in the culture cell or to produce adesired phenotype in the cultured cell. Further, it is also possible tomodify cells, e.g., stem cells, using nucleic acid recombination, e.g.,to insert a transgene, e.g., a transgene encoding a chimeric zinc fingerprotein that regulates an endogenous gene. The modified stem cell can beadministered to a subject. Methods for cultivating stem cells in vitroare described, e.g., in US Application 2002-0081724. In some examples,the stem cells can be induced to differentiate in the subject andexpress the transgene. For example, the stem cells can be differentiatedinto liver, adipose, or skeletal muscle cells. The stem cells can bederived from a lineage that produces cells of the desired tissue type,e.g., liver, adipose, or skeletal muscle cells.

In another embodiment, recombinant cells that express or can express achimeric zinc finger protein, e.g., as described herein, can be used forreplacement therapy in a subject. For example, a nucleic acid encodingthe chimeric zinc finger protein operably linked to a promoter (e.g., aninducible promoter, e.g., a steroid hormone receptor-regulated promoter)is introduced into a human or nonhuman, e.g., mammalian, e.g., porcinerecombinant cell. The cell is cultivated and encapsulated in abiocompatible material, such as poly-lysine alginate, and subsequentlyimplanted into the subject. See, e.g., Lanza (1996) Nat. Biotechnol.14:1107; Joki et al. (2001) Nat. Biotechnol. 19:35; and U.S. Pat. No.5,876,742. Other examples of biocompatible polymers for encapsulatingcells include sodium alginate, barium alginate or sodium cellulosesulfate. Useful polymers enable proteins (e.g., proteins less than 70,20, or 10 kDa) to diffuse across them. Ultra-pure materials can improvethe viability of encapsulated cells and reduce immunological reactions.Encapsulated cells, e.g., cells that include an artificial transcriptionfactor and can produce a diffusible factor can be used as a therapy in asubject to provide the diffusible factor to the subject.

One exemplary method for encapsulating cells and tissues involves theuse of coatings formed of a non-fibrogenic alginate, a gelatinoussubstance that can be derived from certain kinds of kelp. For example,the cells are suspended in a viscous, liquid alginate, which is thenatomized by any of a number of different arrangements into droplets ofsuitable size to encapsulate the cells. Once the droplets come intocontact with a gelling solution, such as calcium chloride or bariumchloride, a single layer alginate coating is created around the cells.

Examples of this approach for creating single layer alginate coatingsusing an electrostatic coating process are shown in U.S. Pat. No.4,789,550, U.S. Pat. No. 4,956,128, U.S. Pat. No. 5,429,821, U.S. Pat.No. 5,639,467, U.S. Pat. No. 5,656,468 and U.S. Pat. No. 5,693,514. Anexample for creating a single layer alginate coating using an air knifeprocess is shown in U.S. Pat. No. 5,521,079. A pressurized process forcoating droplets is described in U.S. Pat. No. 5,260,002 and U.S. Pat.No. 5,462,866. Other examples for creating a single layer alginatecoating using a spinning disk arrangement are shown in U.S. Pat. No.5,643,594 and U.S. Pat. No. 6,001,387. Examples for creating a singlelayer alginate coating using a piezoelectric nozzle are shown in U.S.Pat. No. 5,286,496, U.S. Pat. No. 5,648,099 and U.S. Pat. No. 6,033,888.U.S. Pat. No. 5,470,731 and U.S. Pat. No. 5,531,997 describe a doublelayer coating for tissue that comprises a first layer of a gel-ableorganic polymer and a cationic polymer and a second water-soluble,semi-permeable layer chemically bonded to the first layer. U.S. Pat. No.6,020,200 describes a dual layer coating having a stabilized outer layerformed of a cross-linked polymer matrix. U.S. Pat. No. 5,227,298 (Weberat al.) describes a double walled alginate coating.

Encapsulated cells can be implanted by surgery (e.g., laproscopic orconventional surgical methods) or by injection. Cells can be introducedinto any appropriate body site including the liver, spleen, thymus,testes, brain, pancreas, lungs, kidneys, peritoneal cavity, subcutaneoustissues, fat pads and other locations. See, e.g., J. Rozga et al.,Intraabdominal Organ Transplantation 2000; R. G. Landes Co., USA, 1994:129.

In implementations where the chimeric zinc finger protein regulates anendogenous gene that encodes a secreted protein, production of thesecreted polypeptide can be regulated in the subject by administering anagent (e.g., a steroid hormone) to the subject. In another embodiment,production of the zinc finger protein can be placed under control of anendogenous signal, e.g., a signal indicating reduced level of thesecreted protein. Thus, an artificial feedback loop can be used. Forexample, the signal can be mediated by a transcription factor that isregulated by level of the secreted protein itself.

For additional methods for encapsulating cells, see, for example: U.S.Pat. No. 4,391,909; US 2002-0022016; Lohr et al., (2002) CancerChemother Pharmacol, 49: S21-S24; Hobbs et al., (2001) Journal ofInvestigative Medicine, vol. 49, no.6, 49(6):572-5; Zimmermann et al.(2001) Ann N Y Acad Sci. 2001; Moashebi et al; Tissue Engineering, 2001,vol. 7, 5, 525-534); Orive et al., (2002) Trends in Biotechnology, vol.20, 382-7; Lim and Sun (1980) Science 210: 908-910; Reed et al. 2001.Nature Biotech. 19:29-34; Domish et al., (2001) “Standards andguidelines for Biopolymers in Tissue-Engineered Medical Products: ASTMAlginate and Chitosan Standard Guides.” Ann N Y Acad Sci. 2001;944:388-97.

In still another embodiment, the recombinant cells that express or canexpress a chimeric zinc finger protein are cultivated in vitro. Aprotein produced by the recombinant cells can be recovered (e.g.,purified) from the cells or from media surrounding the cells. In anotherexample the recombinant cells are used as feeder cells.

Pharmaceutical Compositions

In another aspect, the invention provides compositions, e.g.,pharmaceutically acceptable compositions, which include an zinc fingerprotein or a nucleic acid encoding it, e.g., a molecule describedherein, formulated together with a pharmaceutically acceptable carrier.

As used herein, “pharmaceutically acceptable carrier” includes any andall solvents, dispersion media, coatings, antibacterial and antifungalagents, isotonic and absorption delaying agents, and the like that arephysiologically compatible. Preferably, the carrier is suitable forintravenous, intramuscular, subcutaneous, parenteral, spinal orepidermal administration (e.g., by injection or infusion). Depending onthe route of administration, the active compound may be coated in amaterial to protect the compound from the action of acids and othernatural conditions that may inactivate the compound.

A “pharmaceutically acceptable salt” refers to a salt that retains thedesired biological activity of the parent compound and does not impartany undesired toxicological effects (see e.g., Berge, S. M., et al.(1977) J. Pharm. Sci. 66:1-19). Examples of such salts include acidaddition salts and base addition salts. Acid addition salts includethose derived from nontoxic inorganic acids, such as hydrochloric,nitric, phosphoric, sulfuric, hydrobromic, hydroiodic, phosphorous andthe like, as well as from nontoxic organic acids such as aliphatic mono-and dicarboxylic acids, phenyl-substituted alkanoic acids, hydroxyalkanoic acids, aromatic acids, aliphatic and aromatic sulfonic acidsand the like. Base addition salts include those derived from alkalineearth metals, such as sodium, potassium, magnesium, calcium and thelike, as well as from nontoxic organic amines, such asN,N′-dibenzylethylenediamine, N-methylglucamine, chloroprocaine,choline, diethanolamine, ethylenediamine, procaine and the like.

The compositions of this invention may be in a variety of forms. Theseinclude, for example, liquid, semi-solid and solid dosage forms, such asliquid solutions (e.g., injectable and infusible solutions), dispersionsor suspensions, tablets, pills, powders, liposomes and suppositories.The preferred form depends on the intended mode of administration andtherapeutic application. Exemplary compositions are in the form ofinjectable or infusible solutions. One mode of administration isparenteral (e.g., intravenous, subcutaneous, intraperitoneal,intramuscular). In one embodiment, the composition that includes thezinc finger protein or a nucleic acid encoding it is administered byintravenous infusion or injection. In another embodiment, thecomposition that includes the zinc finger protein or a nucleic acidencoding it is administered by intramuscular or subcutaneous injection.

The phrases “parenteral administration” and “administered parenterally”as used herein means modes of administration other than enteral andtopical administration, usually by injection, and includes, withoutlimitation, intravenous, intramuscular, intraarterial, intrathecal,intracapsular, intraorbital, intracardiac, intradermal, intraperitoneal,transtracheal, subcutaneous, subcuticular, intraarticular, subcapsular,subarachnoid, intraspinal, epidural and intrasternal injection andinfusion.

Pharmaceutical compositions typically must be sterile and stable underthe conditions of manufacture and storage. Endotoxin levels in thepreparation can be tested using the Limulus amebocyte lysate assay(e.g., using the kit from Bio Whittaker lot # 7L3790, sensitivity 0.125EU/mL) according to the USP 24/NF 19 methods. Sterility ofpharmaceutical compositions can be determined using thioglycollatemedium according to the USP 24/NF 19 methods. For example, thepreparation is used to inoculate the thioglycollate medium and incubatedat 35° C. for 14 or more days. The medium is inspected periodically todetect growth of a microorganism.

The composition can be formulated as a solution, microemulsion,dispersion, liposome, or other ordered structure suitable to high drugconcentration. Sterile injectable solutions can be prepared byincorporating the active compound (i.e., the zinc finger protein or anucleic acid encoding it) in the required amount in an appropriatesolvent with one or a combination of ingredients enumerated above, asrequired, followed by filtered sterilization. Generally, dispersions areprepared by incorporating the active compound into a sterile vehiclethat contains a basic dispersion medium and the required otheringredients from those enumerated above. In the case of sterile powdersfor the preparation of sterile injectable solutions, the preferredmethods of preparation are vacuum drying and freeze-drying that yields apowder of the active ingredient plus any additional desired ingredientfrom a previously sterile-filtered solution thereof. The proper fluidityof a solution can be maintained, for example, by the use of a coatingsuch as lecithin, by the maintenance of the required particle size inthe case of dispersion and by the use of surfactants. Prolongedabsorption of injectable compositions can be brought about by includingin the composition an agent that delays absorption, for example,monostearate salts and gelatin.

A composition that includes a zinc finger protein or a nucleic acidencoding it can be administered by a variety of methods known in theart. For many applications, the route/mode of administration isintravenous injection or infusion. For example, for therapeuticapplications, the composition that includes a zinc finger protein or anucleic acid encoding it can be administered by intravenous infusion ata rate of less than 30, 20, 10, 5, or 1 mg/min to reach a dose of about1 to 100 mg/m² or 7 to 25 mg/m². The route and/or mode of administrationwill vary depending upon the desired results. In certain embodiments,the active compound may be prepared with a carrier that will protect thecompound against rapid release, such as a controlled releaseformulation, including implants, and microencapsulated delivery systems.Biodegradable, biocompatible polymers can be used, such as ethylenevinyl acetate, polyanhydrides, polyglycolic acid, collagen,polyorthoesters, and polylactic acid. Many methods for the preparationof such formulations are patented or generally known. See, e.g.,Sustained and Controlled Release Drug Delivery Systems, J. R. Robinson,ed., Marcel Dekker, Inc., New York, 1978.

In certain embodiments, the composition may be orally administered, forexample, with an inert diluent or an assimilable edible carrier. Thecompound (and other ingredients, if desired) also may be enclosed in ahard or soft shell gelatin capsule, compressed into tablets, orincorporated directly into the subject's diet. For oral therapeuticadministration, the compounds may be incorporated with excipients andused in the form of ingestible tablets, buccal tablets, troches,capsules, elixirs, suspensions, syrups, wafers, and the like. Toadminister a compound described herein by other than parenteraladministration, it may be necessary to coat the compound with, orco-administer the compound with, a material to prevent its inactivation.

Pharmaceutical compositions can be administered with medical devicesknown in the art. For example, in a preferred embodiment, apharmaceutical composition described herein can be administered with aneedle-less hypodermic injection device, such as the devices disclosedin U.S. Pat. Nos. 5,399,163, 5,383,851, 5,312,335, 5,064,413, 4,941,880,4,790,824, or 4,596,556. Examples of well-known implants and modulesuseful in the invention include: U.S. Pat. No. 4,487,603, whichdiscloses an implantable micro-infusion pump for dispensing medicationat a controlled rate; U.S. Pat. No. 4,486,194, which discloses atherapeutic device for administering medicaments through the skin; U.S.Pat. No. 4,447,233, which discloses a medication infusion pump fordelivering medication at a precise infusion rate; U.S. Pat. No.4,447,224, which discloses a variable flow implantable infusionapparatus for continuous drug delivery; U.S. Pat. No. 4,439,196, whichdiscloses an osmotic drug delivery system having multi-chambercompartments; and U.S. Pat. No. 4,475,196, which discloses an osmoticdrug delivery system. Of course, many other such implants, deliverysystems, and modules also are known.

In certain embodiments, the compounds described herein can be formulatedto ensure proper distribution in vivo. For example, the blood-brainbarrier (BBB) excludes many highly hydrophilic compounds. To ensure thata therapeutic can cross the BBB (if desired), it can be formulated, forexample, in a liposome. For methods of manufacturing liposomes, see,e.g., U.S. Pat. Nos. 4,522,811; 5,374,548; and 5,399,331. The liposomesmay include one or more moieties which are selectively transported intospecific cells or organs, thus enhance targeted drug delivery (see,e.g., V. V. Ranade (1989) J. Clin. Pharmacol. 29:685).

Dosage regimens are adjusted to provide the optimum desired response(e.g., a therapeutic response). For example, a single bolus may beadministered, several divided doses may be administered over time or thedose may be proportionally reduced or increased as indicated by theexigencies of the therapeutic situation. It is especially advantageousto formulate parenteral compositions in dosage unit form for ease ofadministration and uniformity of dosage. Dosage unit form as used hereinrefers to physically discrete units suited as unitary dosages for thesubjects to be treated; each unit contains a predetermined quantity ofactive compound calculated to produce the desired therapeutic effect inassociation with the required pharmaceutical carrier. The specificationfor the dosage unit forms can be dictated by and directly dependent on(a) the unique characteristics of the active compound and the particulartherapeutic effect to be achieved, and (b) the limitations inherent incompounding such an active compound for the treatment of sensitivity inindividuals.

An exemplary, non-limiting range for a therapeutically orprophylactically effective amount of a composition described herein is0.1-20 mg/kg, more preferably 1-10 mg/kg. The composition can beadministered by intravenous infusion at a rate of less than 30, 20, 10,5, or 1 mg/min to reach a dose of about 1 to 100 mg/m² or about 5 to 30mg/m². It is to be noted that dosage values may vary with the type andseverity of the condition to be alleviated. It is to be furtherunderstood that for any particular subject, specific dosage regimensshould be adjusted over time according to the individual need and theprofessional judgment of the person administering or supervising theadministration of the compositions, and that dosage ranges set forthherein are exemplary only and are not intended to limit.

A pharmaceutical composition may include a “therapeutically effectiveamount” or a “prophylactically effective amount” of a zinc fingerprotein or a nucleic acid encoding it, e.g., a protein or nucleic aciddescribed herein. A “therapeutically effective amount” refers to anamount effective, at dosages and for periods of time necessary, toachieve the desired therapeutic result. A therapeutically effectiveamount of the composition may vary according to factors such as thedisease state, age, sex, and weight of the individual, and the abilityof the protein to elicit a desired response in the individual. Atherapeutically effective amount is also one in which any toxic ordetrimental effects of the composition are outweighed by thetherapeutically beneficial effects. A “therapeutically effective dosage”preferably inhibits a measurable parameter, e.g., inflammation or tumorgrowth rate by at least about 20%, more preferably by at least about40%, even more preferably by at least about 60%, and still morepreferably by at least about 80% relative to untreated subjects. Theability of a compound to inhibit a measurable parameter, e.g., cancer,can be evaluated in an animal model system predictive of efficacy inhuman tumors. Alternatively, this property of a composition can beevaluated by examining the ability of the compound to inhibit, suchinhibition in vitro by assays known to the skilled practitioner.

A “prophylactically effective amount” refers to an amount effective, atdosages and for periods of time necessary, to achieve the desiredprophylactic result. Typically, since a prophylactic dose is used insubjects prior to or at an earlier stage of disease, theprophylactically effective amount will be less than the therapeuticallyeffective amount.

Also within the scope of the invention are kits including the zincfinger protein or a nucleic acid that encodes it and instructions foruse, e.g., treatment, prophylactic, or diagnostic use. In an embodimentin which the zinc finger protein regulates the VEGF-A gene, theinstructions for therapeutic applications include suggested dosagesand/or modes of administration in a patient with a cancer or neoplasticdisorder, or angiogenesis related disorder (e.g., certain inflammatorydisorders). The kit can further contain a least one additional reagent,such as a diagnostic or therapeutic agent, e.g., a diagnostic ortherapeutic agent as described herein, and/or one or more additionalzinc finger proteins or nucleic acids, formulated as appropriate, in oneor more separate pharmaceutical preparations.

Treatments

Zinc finger proteins that can regulate an endogenous gene, particularlyproteins that can regulate the VEGF-A gene, have therapeutic andprophylactic utilities. For example, these proteins or nucleic acidencoding them can be administered to cells in culture, e.g. in vitro orex vivo, or in a subject, e.g., in vivo, to treat, prevent, and/ordiagnose a variety of disorders, such as cancers, particularlymetastatic cancers, an inflammatory disorder, and other disordersassociated with increased angiogenesis.

As used herein, the term “treat” or “treatment” is defined as theapplication or administration of an agent which enables a zinc fingerprotein to enter a cell and regulate gene expression, to a subject,e.g., a patient, or application or administration of the agent to anisolated tissue or cell, e.g., cell line, from a subject, e.g., apatient, who has a disorder (e.g., a disorder as described herein), asymptom of a disorder or a predisposition toward a disorder, with thepurpose to cure, heal, alleviate, relieve, alter, remedy, ameliorate,improve or affect the disorder, the symptoms of the disorder or thepredisposition toward the disorder.

In one embodiment, “treating a cell” or “treating a tissue” refers to areduction in at least one activity of a cell, e.g., VEGF-A production,angiogenesis stimulation, proliferation, or other activity of a cell,e.g., a hyperproliferative cell or cell in or near a tissue, e.g., atumor. Such reduction can include a reduction, e.g., a statisticallysignificant reduction, in the activity of a cell or tissue (e.g.,metastatic tissue) or the number of the cell or size of the tissue, theamount or degree of blood supply to the tissue. An example of areduction in activity is a reduction in migration of the cell (e.g.,migration through an extracellular matrix), a reduction in blood vesselformatin, or a reduction in cell differentiation. Another example is anactivity that, directly or indirectly, reduces inflammation or anindicator of inflammation.

As used herein, an amount of a zinc finger protein or a nucleic acidencoding it effective to treat a disorder, or a “therapeuticallyeffective amount” refers to an amount of the protein or nucleic acidwhich is effective, upon single or multiple dose administration to asubject, in treating a cell.

As used herein, an amount of an zinc finger protein or a nucleic acidencoding it effective to prevent a disorder, or a “a prophylacticallyeffective amount” of the protein or nucleic acid refers to an amount ofthe protein or the nucleic acid encoding it, which is effective, uponsingle- or multiple-dose administration to the subject, in preventing ordelaying the occurrence of the onset or recurrence of a disorder, e.g.,a cancer, angiogenesis-based disorder, or inflammatory disorder.

As used herein, the term “subject” is intended to include human andnon-human animals. Exemplary subjects include a human patient having adisorder characterized by abnormal cell proliferation or celldifferentiation. The term “non-human animals” includes all non-humanvertebrates, e.g., non-mammals (such as chickens, amphibians, reptiles)and mammals, such as non-human primates, sheep, dog, cow, pig, etc.

In one embodiment, the subject is a human subject. In one embodiment,the composition of a zinc finger protein or a nucleic acid encoding itcan be administered to a non-human mammal (e.g., a primate, pig ormouse) for veterinary purposes or as an animal model of human disease.Regarding the latter, such animal models may be useful for evaluatingthe therapeutic efficacy of the composition (e.g., testing of dosagesand time courses of administration).

In one embodiment, the invention provides a method of treating aneoplastic disorder. The method can include the steps of contacting acell of a subject with an zinc finger protein or a nucleic acid encodingit, e.g., a zinc finger protein that regulates VEGF-A or a nucleic acidencoding it, e.g., as described herein, in an amount sufficient to treator prevent the neoplastic disorder. For example, the disorder can becaused by a cancerous cell, a tumor cell or a metastatic cell. Thesubject method can be used on cells in culture, e.g. in vitro or exvivo. For example, cancerous or metastatic cells (e.g., renal,urothelial, colon, rectal, lung, breast, endometrial, ovarian,prostatic, or liver cancerous or metastatic cells) can be cultured invitro in culture medium and the contacting step can be effected byadding the zinc finger protein or a nucleic acid encoding it to theculture medium. The method can be performed on cells (e.g., cancerous ormetastatic cells) present in a subject (e.g., a human subject), as partof an in vivo (e.g., therapeutic or prophylactic) protocol. For in vivoembodiments, the contacting step is effected in a subject and includesadministering the zinc finger protein or a nucleic acid encoding it tothe subject under conditions effective to permit regulation of theVEGF-A gene in cells of the subject.

The method can be used to treat a cancer. As used herein, the terms“cancer”, “hyperproliferative”, “malignant”, and “neoplastic” are usedinterchangeably, and refer to those cells an abnormal state or conditioncharacterized by rapid proliferation or neoplasm. The terms include alltypes of cancerous growths or oncogenic processes, metastatic tissues ormalignantly transformed cells, tissues, or organs, irrespective ofhistopathologic type or stage of invasiveness. “Pathologichyperproliferative” cells occur in disease states characterized bymalignant tumor growth.

The common medical meaning of the term “neoplasia” refers to “new cellgrowth” that results as a loss of responsiveness to normal growthcontrols, e.g. to neoplastic cell growth. A “hyperplasia” refers tocells undergoing an abnormally high rate of growth. However, as usedherein, the terms neoplasia and hyperplasia can be used interchangeably,as their context will reveal, referring generally to cells experiencingabnormal cell growth rates. Neoplasias and hyperplasias include“tumors,” which may be benign, premalignant or malignant.

Examples of cancerous disorders include, but are not limited to, solidtumors, soft tissue tumors, and metastatic lesions. Examples of solidtumors include malignancies, e.g., sarcomas, adenocarcinomas, andcarcinomas, of the various organ systems, such as those affecting lung,breast, lymphoid, gastrointestinal (e.g., colon), and genitourinarytract (e.g., renal, urothelial cells), pharynx, prostate, ovary as wellas adenocarcinomas which include malignancies such as most coloncancers, rectal cancer, renal-cell carcinoma, liver cancer, non-smallcell carcinoma of the lung, cancer of the small intestine and so forth.Metastatic lesions of the aforementioned cancers also can be treated orprevented using a method or composition described herein.

The subject method can be useful in treating malignancies of the variousorgan systems, such as those affecting lung, breast, lymphoid,gastrointestinal (e.g., colon), and genitourinary tract, prostate,ovary, pharynx, as well as adenocarcinomas which include malignanciessuch as most colon cancers, renal-cell carcinoma, prostate cancer and/ortesticular tumors, non-small cell carcinoma of the lung, cancer of thesmall intestine and cancer of the esophagus. The term “carcinoma” isrecognized by those skilled in the art and refers to malignancies ofepithelial or endocrine tissues including respiratory system carcinomas,gastrointestinal system carcinomas, genitourinary system carcinomas,testicular carcinomas, breast carcinomas, prostatic carcinomas,endocrine system carcinomas, and melanomas. Exemplary carcinomas includechoriocarcinomas and those forming from tissue of the cervix, lung,prostate, breast, endometrium, head and neck, colon and ovary. The termalso includes carcinosarcomas, e.g., which include malignant tumorscomposed of carcinomatous and sarcomatous tissues. An “adenocarcinoma”refers to a carcinoma derived from glandular tissue or in which thetumor cells form recognizable glandular structures. The term “sarcoma”is recognized by those skilled in the art and refers to malignant tumorsof mesenchymal derivation.

The subject method also can be used to inhibit the proliferation ofhyperplastic/neoplastic cells of hematopoietic origin shown to expressVEGF-A.

Methods of administering zinc finger proteins or nucleic acids aredescribed in “Pharmaceutical Compositions”. Suitable dosages of themolecules used will depend on the age and weight of the subject and theparticular drug used.

A zinc finger protein or a nucleic acid encoding it can be coupled tolabel, e.g., for imaging in a subject after it is delivered to asubject. Suitable labels include MRI-detectable labels or radiolabels.

A zinc finger protein or a nucleic acid encoding it described herein canbe administered alone or in combination with one or more of the existingmodalities for treating cancers, including, but not limited to: surgery;radiation therapy, and chemotherapy.

A zinc finger protein or a nucleic acid encoding it, particularly onethat can regulate (e.g., reducing expression of) the VEGF-A gene, can beadministered alone or in combination with one or more of the existingmodalities for treating an inflammatory disease or disorder. Exemplaryinflammatory diseases or disorders include: acute and chronic immune andautoimmune pathologies, such as, but not limited to, rheumatoidarthritis (RA), juvenile chronic arthritis (JCA), psoriasis, graftversus host disease (GVHD), scleroderma, diabetes mellitus, allergy;asthma, acute or chronic immune disease associated with an allogenictransplantation, such as, but not limited to, renal transplantation,cardiac transplantation, bone marrow transplantation, livertransplantation, pancreatic transplantation, small intestinetransplantation, lung transplantation and skin transplantation; chronicinflammatory pathologies such as, but not limited to, sarcoidosis,chronic inflammatory bowel disease, ulcerative colitis, and Crohn'spathology or disease; vascular inflammatory pathologies, such as, butnot limited to, disseminated intravascular coagulation, atherosclerosis,Kawasaki's pathology and vasculitis syndromes, such as, but not limitedto, polyarteritis nodosa, Wegener's granulomatosis, Henoch-Schonleinpurpura, giant cell arthritis and microscopic vasculitis of the kidneys;chronic active hepatitis; Sjogren's syndrome; psoriatic arthritis;enteropathic arthritis; reactive arthritis and arthritis associated withinflammatory bowel disease; and uveitis.

Inflammatory bowel diseases (IBD) include generally chronic, relapsingintestinal inflammation. IBD refers to two distinct disorders, Crohn'sdisease and ulcerative colitis (UC). The clinical symptoms of IBDinclude intermittent rectal bleeding, crampy abdominal pain, weight lossand diarrhea. A clinical index can also be used to monitor IBD such asthe Clinical Activity Index for Ulcerative Colitis. See also, e.g.,Walmsley et al. Gut. 1998 July; 43(1):29-32 and Jowett et al. (2003)Scand J Gastroenterol. 38(2):164-71.

A zinc finger protein or a nucleic acid encoding it can be used to treator prevent one of the foregoing diseases or disorders. For example, theprotein can be administered (locally or systemically) in an amounteffective to ameliorate at least one symptom of the respective diseaseor disorder. The protein may also ameliorate inflammation, e.g., anindicator of inflammation, e.g., such as local temperature, swelling(e.g., as measured), redness, local or systemic white blood cell count,presence or absence of neutrophils, cytokine levels, and so forth. It ispossible to evaluate a subject, e.g., prior, during, or afteradministration of the protein, for one or more of indicators ofinflammation, e.g., an aforementioned indicator.

A zinc finger protein or a nucleic acid encoding it, particularly onethat can regulate (e.g., increase expression of) the VEGF-A gene, can beadministered alone or in combination with one or more of the existingmodalities for treating a wound, e.g., to promote wound healing. Forexample, generally, activation of VEGF-A can increase formation of newblood vessels and capillaries. The protein or nucleic acid can also beused for ameliorating surgery, burn, traumas, ulcers, bone fractures,and other disorders that require increased angiogenesis.

A zinc finger protein or a nucleic acid encoding it, particularly onethat can regulate (e.g., increase expression of) the VEGF-A gene, can beadministered alone or in combination with one or more of the existingmodalities for treating a cardiovascular disorder, e.g., e.g., ischemicheart disease, peripheral artery disease, or coronary artery disease.

The present invention will be described in more detail through thefollowing practical examples. However, it should be noted that theseexamples are not intended to limit the scope of the present invention.

EXAMPLE 1 Gel Shift Assays

This example provides a method of evaluating the DNA binding propertiesof zinc finger proteins in vitro. Zinc finger proteins were expressed inE. coli, purified, and used in gel shift assays. The DNA segmentsencoding zinc finger proteins were inserted into pGEX-4T2 (PharmaciaBiotech). These constructs were expressed in E. coli strain BL21 toproduce fusion proteins that include the zinc finger proteins connectedto GST (Glutathione-S-transferase). The fusion proteins were purifiedusing glutathione affinity chromatography (Pharmacia Biotech,Piscataway, N.J.) and then digested with thrombin. Thrombin cleaves thelinker sequence between the GST moiety and zinc finger proteins.

Various amounts of a zinc finger protein were incubated with aradioactively labeled probe DNA for one hour at room temperature in 20mM Tris pH 7.7, 120 mM NaCl, 5 mM MgCl₂, 20 μM ZnSO₄, 10% glycerol, 0.1%Nonidet P-40, 5 mM DTT, and 0.10 mg/mL BSA (bovine serum albumin), andthen the reaction mixtures were subjected to gel electrophoresis.Distribution of the probe in the gel was quantitated by PHOSPHORIMAGER™analysis (Molecular Dynamics). Dissociation constants (K_(d)) weredetermined as described (Rebar and Pabo (1994) Science 263:671-673).

We have previously found that zinc finger proteins that function in anin vivo yeast assay also have biochemical activity. In general, when azinc finger protein, e.g., having three zinc finger domains, binds a DNAsequence with a dissociation constant lower than 1 nM, it allows cellgrowth in the one-hybrid yeast cell assay described in US 2002-0061512,whereas when a zinc finger protein binds a DNA sequence with adissociation constant higher than 1 nM, it does not allow cell growth inthat assay. Zinc finger proteins that bind with a dissociation constantof greater than 1 nM but less than 50 nM can also be useful. Forexample, additional fingers can be added to those zinc fingers toproduce tighter or more specific binders.

The in vitro assay can also be adapted to evaluate binding by anindividual zinc finger domain to a particular three or four basepairsite. In one implementation, the individual zinc finger domain isevaluated in the context of fingers 1 and 2 of Zif268 and a target sitethat includes (i) basepairs recognized by fingers 1 and 2 and (ii) theparticular three or four basepair site.

EXAMPLE 2 Construction of Individual Three-Fingered Proteins

This example provides a method for constructing a nucleic acid encodinga chimeric three-fingered protein. The vector P3 (Toolgen, Inc.) wasused to express chimeric zinc finger proteins in mammalian cells. P3 wasconstructed by modification of the pcDNA3 vector (Invitrogen, San DiegoCalif.). A synthetic oligonucleotide duplex having compatible overhangswas ligated into the pcDNA3 vector digested with HindIII and XhoI. Theduplex contains nucleic acid that encodes the hemagglutinin (HA) tag anda nuclear localization signal. The duplex also includes BamHI, EcoRI andNotI and BglII restriction site sites and a stop codon. Further, theXmaI site in SV40 origin of the resulting vector was destroyed bydigestion with XmaI, filling in the overhanging ends of the digestedrestriction site, and religation of the ends.

The following is one exemplary method for constructing a plasmid thatencodes a chimeric zinc finger protein with multiple zinc fingerdomains. First, an insert that encodes a single zinc finger domain wasinserted into a vector (the P3 vector) that harbored a sequence encodinga single zinc finger domain. The result of this cloning is a plasmidthat encodes a zinc finger protein with two zinc finger domains. A zincfinger domain insert consisting of two zinc finger domains was preparedby the above method and cloned into AgeI/NotI-linearized vector P3having one or two zinc finger domains to obtain a plasmid containing azinc finger protein gene consisting of three or four zinc fingerdomains.

Genes encoding chimeric zinc finger proteins were then cloned intopre-prepared plasmids that encode a functional domain, e.g., p65transcriptional activation domain, a Kid transcriptional repressiondomain, or a KOX transcriptional repression domain. The plasmids thatinclude the genes encoding chimeric zinc finger proteins were digestedwith EcoRI/NotI and ligated into plasmids linearized with the sameenzymes. The cloning site in the acceptor plasmids (pLFD-p65, pLFD-KRAB,pLFD-KOX) placed the sequence encoding the zinc finger domains in aposition that results in the DNA binding region being N-terminal to thefunctional domain. The resulting constructs encode a protein thatincludes, from N- to C-terminus: HA-tag, Nuclear localization signal,zinc finger protein and the functional domain.

EXAMPLE 3 In vivo Assays for Three-Fingered Proteins with Human ZincFinger Domains

An in vivo repression assay was used to determine if the newthree-fingered proteins were functional in vivo. See, for example, Kimand Pabo ((1997) J Biol Chem 272:29795-29800). The assay utilized aluciferase reporter construct in which a target site is located at aposition comparable to the position of the Zif268 site in the constructof Kim and Pabo, supra.

The luciferase reporter plasmids were constructed from pAS-modi, amodified version of pGL3-TATA/Inr (Kim and Pabo, supra). These reportersutilize firefly luciferase as the reporter protein. The SacI siteupstream of the TATA box was deleted from pAS-modi. A new SacI site wasinserted following the transcription initiation site. Different reporterplasmids were made for each of the different zinc finger proteins. Toconstruct each plasmid, an oligomer containing a given nine basepairbinding site that is predicts to interact with a particular zinc fingerprotein was inserted into the plasmid. The plasmid pAS-modi was digestedwith SacI and HindIII, and the oligomer was inserted. This manipulationreplaces 14 base pairs at a position 12 basepairs downstream from thetranscription initiation site. The resulting reporter plasmids werenamed p1G-ZFP ID, wherein ID was the name of the corresponding zincfinger protein.

The in vivo activity assay for a particular three-fingered protein wascarried out as follows. HEK 293 cells were transfected with fourplasmids: 14 ng of a plasmid expressing the particular three-fingeredprotein; 14 ng of the reporter plasmid described above; 70 ng of aplasmid that expresses GAL4VP16; and 1.4 ng of a plasmid that expressesRenilla luciferase. The GAL4-VP16 activates transcription of the minimalsynthetic promoter in the reporter absent repression by a particularthree-fingered protein. The ability of different zinc finger proteinswas compared to other three-fingered proteins. The plasmid expressingRenillar luciferase controlled for transfection efficiency.

LIPOFECTAMINE™ (Gibco-BRL) was used for the transfection procedures.Cells were transfected at 30-50% confluency in wells of a 96 well plate.The cells were incubated for two days prior to harvesting for theluciferase assay. Then luciferase activities were measured using theDUAL-LUCIFERASE™ Reporter Assay System (Promega). The observed fireflyluciferase activity was normalized using the observed level of Renillaluciferase. The extent of repression or “fold-repression” was calculatedby dividing a value for normalized reporter expression in the absence ofa zinc finger protein by a value for normalized reporter expression inthe presence of the zinc finger protein.

Zinc finger proteins were classified as satisfying a high stringencycut-off value if they repressed transcription at least 2-fold in thetransfection assay or as satisfying a low stringency cut-off value ifthey repressed between 1.5 and 2-fold in the transfection assay.

EXAMPLE 4 Binding Assay Result of ZFPs with Their Specific Reporter

Gel shift assays were used to correlate activity observed in the in vivoassays to binding affinity. The binding of Zif268 to different targetsequences was evaluated using gel shift assays and the transfectionassay described above in Example 3. A good correlation was observedbetween the dissociation constants measured by gel shift assays and thelevel of transcriptional repression in the transfection assays describedabove. In general, zinc finger proteins exhibiting more than 2-foldrepression (that is, 50% repression) in the transfection assays showed adissociation constant of less than 1 nM as determined by gel shiftassays.

EXAMPLE 5 Characterization of Three-Fingered Proteins

Two types of “three-finger” chimeric zinc finger proteins wereconstructed. One type includes chimeric proteins that are composedexclusively of wild-type human zinc finger domains, i.e., domains thatare identical to naturally-occurring human zinc finger domains. Theother type includes chimeric proteins that include zinc finger domainsthat are not identical to a naturally-occurring zinc finger domain. Thelatter zinc finger domains were typically identified by in vitromutagenesis of a naturally-occurring zinc finger domain followed byphage display selection. Such mutant domains have avoided the scrutinyof natural evolution.

A total of 36 zinc finger domains, 18 human zinc finger domains and 18mutated zinc finger domains, were used to assemble a set of testthree-fingered proteins. The mutated zinc finger domains have beenreported in Choo and Klug (1994) Proc. Natl. Acad. Sci. USA91:11168-11172; Desjarlais and Berg (1994) Proc. Natl. Acad. Sci. USA.91:11099-11103; Dreier et al. (2001) J. Biol. Chem. 276:29466-29478;Dreier et al. (2000) J. Mol. Biol. 303:489-502; Fairall et al. (1993)Nature 366:483-487; Greisman and Pabo (1997) Science. 275:657-661; Kimand Pabo (1997) J. Biol. Chem. 272:29795-29800; and Segal et al. (1999)Proc. Natl. Acad. Sci. USA 96:2758-2763. See also Table 9 of US2003-165997. Nucleic acids encoding the 36 domains were individuallysubcloned into P3 vector digested with EcoRI and NotI, and the resultingplasmids were used as starting material for the chimeric zinc fingerprotein construction.

Nucleic acids encoding chimeric three-fingered proteins were prepared bytwo different methods. In the first method, nucleic acids encoding allthe zinc finger domains were randomly mixed, and three-fingeredconstructs were randomly picked for further analysis. Each construct wassequenced to determine the component zinc finger domains in thepolypeptide that it encodes. Subsequently, target DNA sequences weresynthesized for each randomly assorted three-fingered protein. TargetDNA sequences were based on the expected preferred target site. Thetargets were cloned into the luciferase reporter vector described above.This approach is referred to as “zinc finger protein-first” approach.

In the second method, nucleic acid encoding chimeric three-fingeredproteins were assembled based on a given target DNA sequences. Acomputer algorithm was used to match recognition sites of zinc fingerdomains and target DNA sequences. Promoter sequences of known genes wereused as the input target DNA sequences. The promoter sequences werescanned to identify segments that are nine nucleotides in length andthat are acceptable target sites for recognition by chimericthree-fingered proteins given the available collection of zinc fingerdomains. Once identified, a nucleic acid was constructed that encodedthe chimeric three-fingered proteins. This approach is referred to as“target site-first” approach.

Zinc finger domains that include an aspartate residue at position 2 ofthe base contacting residues were analyzed with special consideration.Such zinc finger domains include RDER1, RDHT, RDNR, RDKR, RDTN, TDKR,and NDTR. The X-ray co-crystal structure of Zif268 bound to DNA showedthat an aspartate at position 2 can form a hydrogen bond with a baseoutside of the 3-basepair subsite recognized by zinc fingers. As aresult, the RDER finger containing an aspartate residue at position 2prefers the 4-basepair site: 5′-GCG (G/T)-3′. The computer algorithmaccounted for this additional specificity. Randomly-assembledthree-fingered proteins that include a finger having aspartate atposition 2 and that violate this rule for the 4-bp site were excluded inother analyses described herein.

A total of 153 three-fingered proteins were constructed from both the“zinc finger protein-first” and the “target site-first” approaches.These proteins were tested using the transient cotransfection assaydescribed in Example 3.

31 of 153 chimeric zinc finger proteins demonstrated greater than 2-foldrepression, the high stringency criterion (RF≧2; RF=fold repression). Ofthe proteins constructed entirely from naturally-occurring human zincfinger domains, 28.1% (27 of 96) exceeded the high stringency criterionand 59.4% exceeded the low stringency criterion (RF≧1.5). Of theproteins constructed from two naturally-occurring zinc finger domainsand one mutated domain, 33.3% exceeded the high stringency criterion,and only 20% exceeded the low stringency criterion.

In contrast, of the 17 proteins constructed from one human domain andtwo mutated domains, only one protein (5.9%) exceeded the highstringency criterion, and only two proteins (11.8%) exceeded the lowstringency criterion. Strikingly, no zinc finger proteins composedexclusively of mutated domains satisfied the high stringency criterionin the repression assay. Only one such protein (4%) satisfied the lowstringency criterion. These results indicate that naturally-occurringhuman zinc finger domains are frequently better building blocks for theconstruction of new DNA-binding proteins than mutated domains.

EXAMPLE 6 Designed Chimeric Zinc Finger Proteins that Bind to the VEGF-AGene

In this example, we designed chimeric zinc finger proteins that bind toDNA elements in the human vascular endothelial growth factor A (VEGF-A)gene. The −950 to +450 region of the VEGF-A promoter was scanned toidentify nine nucleotide sites that are compatible for recognition byavailable combinations of zinc finger domains in a three-fingeredconfiguration.

We constructed several DNA constructs encoding zinc finger proteins thatinclude domains designed to recognize such nine nucleotide sites. Theproteins were expressed in E. coli and purified. We evaluated their DNAbinding specificity using a SELEX (Systematic Evolution of Ligands byEXponential enrichment) experiment. Many zinc finger proteins that weredesigned to target the VEGF-A promoter demonstrated the expectedDNA-binding specificities. Nearly all of the consensus sequencesobtained from the SELEX analyses were identical to the intended targetsequences in the VEGF-A gene. One exemplary zinc finger protein, termedF121, showed a consensus sequence that differed from the intended targetsequence by one base at a position where the corresponding zinc fingershows degeneracy in base recognition.

Transcription factors that include a DNA binding domain with theseartificial zinc fingers were generated by fusing nucleic acids encodingthe three zinc finger domains to a nucleic acid encoding either the p65or VP16 activation domain. The resulting nucleic acid was inserted intoan expression plasmid.

FIG. 4 shows the locations of the DNA binding sites in the VEGF-Apromoter that are recognized by these chimeric zinc finger proteins. Thehuman VEGF-A promoter contains at least two DNase I-hypersensitiveregions. The binding of engineered zinc finger proteins transcriptionfactors to these sites can activate VEGF-A gene expression. F480 wasdesigned to recognize a site at about −633R (“R” designates the reversestrand). F475 was designed to recognize a site at about −455. F435 wasdesigned to recognize a site at about −391R and a site at about −90R.F83 was designed to recognize a site at about +359. F121 was designed torecognize a site at about +434.

We found that regardless of the location of the binding sites, four zincfinger proteins (F480, F475, F121, and F435) that we tested activatednot only a luciferase reporter gene under the control of the VEGF-Apromoter, but also the endogenous VEGF-A gene itself. An ELISA on mediafrom the transiently transfected cells indicated that these chimericzinc finger proteins also up-regulated production of the VEGF-A protein13- to 21-fold.

When two of the chimeric zinc finger proteins, F435 and F121, were fusedindividually to the KRAB repression domain, they each actively repressedVEGF-A expression in HEK 293 cell. Control cells that had beentransfected with the parental expression vector (which contained no zincfinger protein coding sequences) did not show any increase or decreasein VEGF-A mRNA or protein levels.

The protein F83 did not show any effect on the levels of VEGF-A mRNA orprotein in these assays). This may be due to the binding of some otherprotein to the target site or to the local chromatin structure, whichmight have rendered the target DNA inaccessible to the zinc fingerprotein. There was no absolute correlation between the levels of VEGF-Aexpression by these zinc finger proteins and their DNA-bindingaffinities or their expression levels in cells.

To investigate the specificity of zinc finger proteins on a genome-widescale, we performed DNA microarray experiments with 293 cell lines thathad been stably transfected with DNA constructs that encode one of thefollowing three zinc finger transcription factors: F121-p65, F435-p65,and F475-VP16. Fifty-one of 7458 genes were regulated by all three zincfinger transcriptional activator proteins. Forty-nine were up-regulatedmore than two-fold, and two were down-regulated more than two-fold. Mostof these co-regulated genes appear to be closely associated with VEGF-Afunction. Many of them are regulated by VEGF-A, involved in angiogenesisor hypoxia, or expressed in vascular endothelial cells. Therefore, it islikely that these genes are downstream targets of VEGF-A expression. Inaddition, numerous other genes were regulated by one or two of the zincfinger protein activators but not by all three tested proteins. Sincethese zinc finger proteins recognize nine basepairs site, it is possiblethat these zinc finger proteins directly regulate genes other thanVEGF-A, e.g., by binding to identical or related target sites in othergenes. Construction of four, five, or six-fingered proteins may improvespecificity. Taken together, these data indicate that the described zincfinger proteins, which were assembled by shuffling naturally-occurringzinc finger domains, function in cells as transcriptional regulators ofspecific endogenous genes.

For example, a protein described herein may regulate one or more of thefollowing genes: jun B proto-oncogene (N94468), EphA2 (H84481), EphB4(AI261660), fibroblast growth factor receptor 3 (achondroplasia,thanatophoric dwarfism) (AA419620), FK506-binding protein 8 (388 kD)(N95418), protein kinase C, zeta (AA458993), v-erb-b2 avianerythroblastic leukemia viral oncogene homolog 3 (AA664212), lectin,galactoside-binding, soluble, 1 (galectin 1) (AI927284), proteinphosphatase 2, regulatory subunit B (B56), alpha isoform (R59165),insulin-like growth factor 2 (somatomedin A) (N54596), plectin 1,intermediate filament binding protein, 500 kD (AA448400), Periplakin(AI703487), choline kinase (H09959), collagen, type VI, alpha 1(H99676), adaptor-related protein complex 1, sigma 1 subunit (W44558),arrestin, beta 2 (AW009594), GATA-binding protein 2 (H00625),cyclin-dependent kinase inhibitor 1A (p21, Cip1) (AI952615),mitogen-activated protein kinase kinase kinase 11 (R80779),acetylcholinesterase (YT blood group) (AI360141), brain-specificNa-dependent inorganic phosphate cotransporter (AA702627), cellularretinoic acid-binding protein 1 (AA454702), cellular retinoicacid-binding protein 2 (AA598508), cadherin 13, H-cadherin (heart)(R41787), calcium channel, voltage-dependent, beta 3 subunit (R36947),carbonic anhydrase XI (N52089), troponin T1, skeletal, slow (AA868929),gamma-aminobutyric acid (GABA) B receptor, 1 (N70841), adenylate cyclaseactivating polypeptide 1 (pituitary) receptor type I (H09078), solutecarrier family 4, anion exchanger, member 2 (erythrocyte membraneprotein band 3-like 1) (W45518), glypican 1 (AA455896), protein Cinhibitor (plasminogen activator inhibitor III) (W86431),cyclin-dependent kinase inhibitor 1C (p57, Kip2) (AI828088), zinc fingerprotein 43 (HTF6) (AA773894), zinc finger protein homologous to Zfp-36in mouse (R38383), Meis (mouse) homolog 3 (AA703449), SWI/SNF related,matrix associated, actin dependent regulator of chromatin, subfamily d,member 3 (AA053810), ( ), unknown (R11526), unknown (AA045731), unknown(T51849), unknown (T50498), putative gene product (H09111), B/K protein(H23265), damage-specific DNA binding protein 2 (48 kD) (AA410404),dihydropyrimidinase-like 4 (AA757754), N-methylpurine-DNA glycosylase(N26769), protein tyrosine phosphatase, receptor type, N (R45941),fasciculation and elongation protein zeta 1 (zygin I) (H20759),lanosterol synthase (2,3-oxidosqualene-lanosterol cyclase) (AA437389), (), ( ), spennidine/spermine N1-acetyltransferase (AA011215), and adisintegrin-like and metalloprotease (reprolysin type) withthrombospondin type 1 motif, 1 (T41173). The expression of these proteinor the genes that encode them can be regulated at least 0.5, 1.0, 2, 5,10, or 50-fold, e.g., between 2 and 80-fold.

Exemplary sites in the VEGF promoter and proteins that can recognizethem include: TABLE 7 VEGF-A Promoter Sites (A) Protein Site SequenceF475 −455 GAG CGG GGA F121 +434 TGG GGG TGA F435 −90R GGG CGG GGA F547−665 AAT AGG GGG F2825 +434 TGG GGG TGA

TABLE 8 VEGF-A Promoter Sites (B) Protein Site Sequence F480 −633R GGGTGG GGG F435 −391R GGG TGG GGA F2828 +435 GGG GGT GAC F625 +435 GGG GGTGAC F2830 +435 GGG GGT GAC F2838 +435 GGG GGT GAC

TABLE 9 VEGF-A Promoter Sites (C) SEQ ID Protein Site Sequence NO: F2604−680 GTT TGG GAG GTC 76 F2605 −677 TGG GAG GTC AGA 77 F2607 −671 GTC AGAAAT AGG 78 F2615 −606 GCC AGA GCC GGG 79 F2633 −455 GAG CGG GGA GAA 80F2634 −395R GGG GAG AGG GAC 81 F2636 −393R GTG GGG AGA GGG 82 F2644−358R GGG GCA GGG GAA 83 F2646 −314R GAC AGG GCC TGA 84 F2650 −206 GGTGGG GGT CGA 85 F2679 +244R CAA GTG GGG AAT 86

TABLE 10 VEGF-A Promoter Sites (D) SEQ ID Protein Site Sequence NO:F2610 −633R GGG TGG GGG GAG 87 F2612 −630R AGG GGG TGG GGG 88 F2638−391R GGG TGG GGA GAG 89

TABLE 11 VEGF-A Promoter Sites (E) SEQ ID Protein Site Sequence NO: F109 536B GAG CGA GCA GCG 90 F2608 −668 AGA AAT AGG GGG 91 F2611 −631R GGGGGT GGG GGG 92 F2617 −603 AGA GCC GGG GTG 93 F2619 −554 AGG GAA GCT GGG94 F2623 −495 GTG GGT GAG TGA 95 F2625 −475 GTG TGG GGT TGA 96 F2628−468 GTT GAG GGT GTT 97 F2629 −465 GAG GGT GTT GGA 98 F2630 −462 GGT GTTGGA GCG 99 F2634 −395R GGG GAG AGG GAC 100 F2635 −394R TGG GGA GAG GGA101 F2637 −392R GGT GGG GAG AGG 102 F2642 −385R AGG GAC GGG TGG 103F2643 −382R GAC AGG GAC GGG 104 F2648 −282R GAG GAG GGA GCA 105 F2651−203 GGG GGT CGA GCT 106 F2653 −184R GAA GGG GAA GCT 107 F2654 −181R AATGAA GGG GAA 108 F2662 −124R GCG GCT CGG GCC 109 F2667 −85 GGG CGG GCCGGG 110 F2668 −30R AAA AAA GGG GGG 111 F2673 +77 GCA GCG GTT AGG 112F2682 +283R GGG GAA GTA GAG 113 F2689 +342 AGA GAA GTC GAG 114 F2697+357 GAG AGA GAC GGG 115 F2699 +366 GGG GTC AGA GAG 116 F2703 −632R GGGGTG GGG GGA 117 F2702 +474R CAA GGG GGA GGG 118Construction of a Yeast Expression Plasmid for a Zinc Finger Library

We constructed an expression plasmid encoding a zinc fingertranscription factor by modification of pPC86 (Chevray and Nathans(1992) Proc. Natl. Acad. Sci. USA 89:5789-5793). A gene encoding theZif268 zinc finger protein was inserted between the SalI and EcoRI sitesof pPC86 to generate pPCFM-Zif, in which the Gal4 activation domain isfused to the Zif268 domain. pPCFM-Zif was used as a vector forconstructing libraries of zinc fingers. To construct human zinc fingerlibraries, DNA segments encoding zinc fingers were amplified from humangenomic DNA using the polymerase chain reaction (PCR) (Promega, Madison,Wis.) and mixtures of degenerate PCR primers with the sequenceHis-Thr-Gly-Glu/Gln-Lys/Arg-Pro-Tyr/Phe, which is frequently found atthe junction between zinc fingers in naturally-occurring zinc fingerproteins. The 100-bp PCR products encoding the zinc fingers weredigested with SacII and AvaI and inserted into pPCFM-Zif, which encodeshybrid transcription factors consisting of finger 1 and finger 2 ofZif268 and a zinc finger domain derived from the human genome (togetherforming three-fingered protein). The plasmid library was prepared from atotal of 1.2×10⁶ E. coli transformants.

Reporter plasmids were prepared by inserting one of 64 pairs ofcomplementary oligonucleotides that contained three copies of a 9-bptarget sequence into pRS315(His) and pLacZi (Clontech, Palo Alto,Calif.).

Gap Repair Cloning of Human Zinc Finger Domains Selected from the HumanGenome

Gap repair cloning of DNA sequences that encode individual zinc fingerdomains was carried out as described (Hudson et al. (1997) Genome Res.7:1169-1173). To clone a DNA segment that encode a zinc finger, twooverlapping oligonucleotides were synthesized. Each oligonucleotideincluded a 21-bp common tail at its 3′ end for a second round of PCR aswell as a specific sequence that can anneal to the nucleic acid sequencethat encodes the individual zinc finger domain. DNA sequences encodingzinc fingers were amplified from human genomic DNA with an equimolarmixture of two corresponding oligonucleotides.

Amplification products from the initial round of PCR were used astemplates in a second round of PCR. The primers for the second round ofPCR had two regions, one identical to a segment of pPCFM-Zif and anotheridentical to the 21-bp common tail. A mixture of the second-round PCRproducts and linearized pPCFM-Zif that had been digested with MscI andEcoRI were transformed into the yW1(MATα Δgal4 Δgal80 lys2801 his3-Δ200trp1-Δ63 leu2 ade2-101CYH2) yeast strain. A total of 823 human zincfingers were cloned by this method. Many were used in our in vivoselection systems described herein.

In Vivo Selection of Zinc Finger Domains

Yeast mating was used to facilitate identification of zinc fingers thatbind to each three basepair target site. The zinc finger library wasintroduced into the yW1 (MATα) strain, and ˜1.47×10⁶ independenttransformed yeast colonies were generated. Aliquots of these transformedcells were mated for 5 h at 30° C. with the haploid yeast strain yW1a(MATa), which contained the 64 reporter plasmids in each of two sets(one for each of the reporter genes). The reporter plasmids containedthree copies of the target DNA sequences adjacent to the coding regionsof either the LacZ or HIS3 genes. The resulting diploids were plated onselective media that contained X-gal (40 μg/ml) and 3-amino triazole(3-AT) (1 mM) but lacked histidine. Plasmids isolated from blue(positive) colonies were re-transformed to confirm the results andsequenced to identify their encoding zinc finger domains. The bindingaffinity and specificity of each zinc finger fused to fingers 1 and 2 ofZif268 were determined both in yeast and by EMSA. These methods aredescribed below.

Construction of Three-Fingered Proteins Using Selected Zinc Fingers asModular Building Blocks

A modified version of the pcDNA3 (Invitrogen, Carlsbad, Calif.) vector(P3) was used as a parental vector for expressing zinc finger proteinsin mammalian cells. P3 contains an HA tag and a nuclear localizationsignal, both of which were inserted 3′ to the initiation codon. DNAsegments that encode individual zinc finger domains were subcloned intothe P3 vector between the EcoRI and NotI sites, and the resultingplasmids were used as starting material for chimeric zinc finger proteinconstruction. New three-fingered proteins were prepared by two differentmethods. In the first method, all the zinc fingers were mixed, andassembled three-fingered constructs were randomly chosen for furtheranalysis. In the second method, new three-fingered proteins weredesigned to target specific DNA sequences. To this end, we used a simplecomputer algorithm that finds a match between recognition sites of zincfingers and target DNA sequences. We used promoter sequences of knowngenes as the input DNA sequences and identified three-fingered proteinsthat should bind to nine basepair DNA elements within the inputsequences.

Zinc finger proteins that target the VEGF-A gene were constructed bythis method. The constructed zinc finger proteins were tested for theirDNA binding ability and affinity in mammalian cells as describedpreviously. Kim and Pabo (1997) J. Biol. Chem. 272, 29795-29800; Kim andPabo (1998) Proc. Natl. Acad. Sci. USA 95, 2812-2817; and Kang and Kim(2000) J. Biol. Chem. 275:8742-8748. The reporter plasmid for the assaywas constructed using pGL3-TATA/Inr which harbors the firefly luciferasegene as the reporter.

To connect functional domains to the zinc finger proteins, thetranscriptional activation domain of p65 (amino acids 288-548) and VP16(amino acids 413-490) were amplified by PCR using pairs of specificoligomers, and the PCR products for p65 and VP16 were cloned separatelyinto P3 to generate pLFD-p65 and pLFD-VP16, respectively. Nucleic acidsthat encode zinc finger proteins that target the VEGF-A promoter wereinserted into pLFD-p65 or VP16 to express zinc finger protein-activationdomain (AD) fusions proteins (ZFP-AD). Real-time PCR, ELISA, andmicroarray analyses were carried out to determine whether these ZFP-ADsactivate the VEGF-A gene. In addition, SELEX was performed to testwhether these proteins recognize the appropriate target DNA sequences.See below.

Binding Affinity and Specificity of Human Zinc Finger Domains

Plasmids isolated from blue yeast colonies (see section entitled “Invivo selection of zinc finger domains”) were individually retransformedinto yW1 cells. For each isolated plasmid, re-transformed yW1 cells weremated to yW1a cells that contained each of the 64 LacZ reporterplasmids. The resulting cells were then spread onto minimal media thatcontained X-gal and histidine but lacked tryptophan and uracil. Usingthe GEL-DOC™ system (Bio-Rad, Hercules, Calif.), we measured theintensity of the blue color for each colony to determine the DNA-bindingaffinities and specificities of each of the zinc finger domains thatwere fused to fingers 1 and 2 of Zif268. Control experiments with theZif268 protein indicated that positive interactions between a zincfinger domain and a target binding site in the promoter of the LacZreporter yielded dark to pale blue colonies (the blue intensity isproportional to the binding affinity) and that negative interactionsyielded white colonies.

Electrophoretic Mobility Shift Assay (EMSA)

DNA segments that encode zinc finger proteins were isolated by digestionwith SalI and NotI, and were inserted into pGEX-4T2 (Amersham Pharmacia,Uppsala, Sweden). Zinc finger proteins were expressed in E. coli strainBL21(DE3) as fusion proteins linked to glutathione-S-transferase (GST).The fusion proteins were purified using glutathione affinitychromatography (Amersham Pharmacia) and then digested with thrombin.This cleavage event severs the connection between the GST moiety and thezinc finger proteins. In this case, purified zinc finger proteinscontained fingers 1 and 2 of Zif268 fused to selected zinc fingers inposition 3 at the C-terminus. Probe DNAs were synthesized, annealed,labeled with ³²p using T4 polynucleotide kinase, and EMSAs were carriedout as described. Kim and Pabo (1997) J. Biol. Chem. 272, 29795-29800and Kim and Pabo (1998) Proc. Natl. Acad. Sci. USA 95, 2812-2817. Thesame procedure can be used to test other zinc finger proteins.

Transcriptional Regulation of Endogenous VEGF

Human embryonic kidney 293 cells were maintained in Dulbecco's modifiedEagle medium (DMEM) supplemented with 100 units/ml penicillin, 100 μg/mlstreptomycin, and 10% fetal bovine serum (FBS). For the luciferaseassay, 10⁴ cells/well were pre-cultured in a 96-well plate. Using aLIPOFECTAMINE™ transfection kit (Life Technologies, Rockville, Md.), 293cells were transfected with 25 ng of a reporter plasmid in which thenative VEGF-A promoter was fused to the luciferase gene in pGL3-basic(Promega), and 25 ng of a plasmid encoding a zinc finger protein. After48 h of incubation, luciferase activity was measured with a DUALLUCIFERASE™ assay kit (Promega) using a TD-20/20 luminometer (TurnerDesigns Inc., Sunnyvale, Calif.).

For reverse transcriptase-PCR (RT-PCR) analyses and ELISA, 10⁵cells/well were pre-cultured in 1 ml of culture medium (supplementedwith 10% FBS but deprived of antibiotics) in a 12-well culture plate for24 h at 37° C. in a humid atmosphere containing 5% CO₂. The cells werethen transfected with DNA using a LIPOFECTAMINE™ transfection kit (LifeTechnologies). Briefly, 1 μg of a plasmid encoding a zinc finger proteinwas added to 5 μl plus reagent in a total of 50 μl DMEM, and thissolution was then mixed with another 50 μl of DMEM containing 2 μl ofLIPOFECTAMINE™ reagent. After 15 min of incubation, the entire 100 μlmixtures were added to cells in a culture plate, and the cells weregrown for an additional 48 h. The cells and culture supernatants wereharvested for RT-PCR analysis and ELISA.

Quantitative RT-PCR

Total cellular RNA was extracted from TRIZOL™-lysates according to themanufacturer's instructions (Life Technologies). The reversetranscription reactions were performed with 4 μg total RNA usingoligo-dT as the first-strand synthesis primer for mRNA and the MMLVreverse transcriptase provided in the SUPERSCRIPT™ first-strandsynthesis system (Life Technologies). To analyze mRNA quantities, 1 μlof the first-strand cDNAs generated from the RT reactions were amplifiedusing VEGF-A-specific primers. The initial amounts of RNA werenormalized to glyceraldehydes-3-phosphate dehydrogenase (GAPDH) mRNAconcentrations that had been calculated by specific amplification usingGAPDH-specific primers. The amplification of VEGF-A and GAPDH-specificcDNAs was monitored and analyzed in real-time with a QUANTITECT SYBR™kit (QIAGEN, Valencia, Calif.) and ROTORGENE™ 2000 real-time cycler(Corbett, Sydney, Australia) and was quantified using serial dilution ofthe standards included in the reactions.

ELISA

The kidney 293 cell culture supernatants were briefly centrifuged for 5min to remove cells and cell debris. VEGF protein that accumulated inthe culture medium (100 ul each) and dilutions of a recombinant humanVEGF-A protein standards were analyzed using sandwich ELISA (enzymelinked immunosorbent assay), wherein the supernatant of culture wasreacted with a anti-human VEGF antibody (R&D systems; AF-293-NA),biotinylated anti-human VEGF antibody (R&D systems; BAF293),streptavidin alkaline phosphatase. The antigen-antibody complex wasreacted with pNPP (p-Nitrophenyl phosphate) dissolved in pNPP buffer(Chemicon; ES011). VEGF-A concentrations in the samples were determinedfrom the absorbance at 405 nm which was measured with POWERWAVE™X340(Bio-TEK Instrument Inc., Winooski Vt.).

DNA Microarray Analysis of FlpTRex-293 Cell Lines Stably Expressing ZincFinger Proteins

Plasmids encoding ZFPs designed to target the VEGF-A promoter werestably introduced into FlpTRex-293 cell lines (Invitrogen) essentiallyas described in the manufacturer's protocol. Briefly, the HindIII-XhoIfragment from a pLFD-p65 or a pLFD-VP16 vector that contained DNAsegments encoding zinc finger proteins was subcloned into pCDNA5/FRT/TO(Invitrogen). The resulting plasmids were cotransfected with pOG44(Invitrogen) into FlpTRex-293 cells, and stable integrants werescreened. The resulting cell lines express ZFP-p65 or ZFP-VP16 upondoxycycline induction.

DNA microarrays containing 7458 human expressed sequence tag (EST)clones were provided by Genomic Tree, Inc. (Taejon, Korea). FlpTRex-293cells stably expressing ZFP-p65 or ZFP-VP16 were grown with (+Dox) orwithout (−Dox) 1 μg/ml Doxycycline for 48 h. Total RNA was prepared fromeach sample. RNA from a -Dox sample was used as the reference (Cy3).Microarray experiments were performed according to the manufacturer'sprotocol.

SELEX of Assembled Zinc Finger Proteins

A template oligonucleotide was designed to contain a random20-nucleotide region flanked, on both sides, by invariant sequences. Inaddition, two primers that were complementary to the invariant regionsof the template oligonucleotide were designed for the PCR amplification.The template oligonucleotide was converted to double-stranded DNA byKlenow fragment extension from one of the primers. For enrichment of thetarget sequences bound by zinc finger proteins, 100 μg of the GST-fusionproteins was mixed with 10 pmol of double-stranded template DNA in 100μl of binding buffer (25 mM Hepes pH 7.9, 40 mM KCl, 3 mM MgCl₂, 1 mMDTT) for one hour at room temperature. GST-resin (10 μl) was then addedto the mixture. After incubation for 30 min at room temperature, theresin was washed three times with binding buffer containing 2.5% skimmilk.

The bound double-stranded template oligomers were dissociated byincubating the resins with 100 μl of 1 M KCl for 10 min at roomtemperature. After PCR amplification of the rescued double-strandedtemplate oligomers, a new round of SELEX was repeated. This process wasrepeated eight times. The final PCR product was digested with XbaI andBamHI and inserted into pBLUESCRIPT™ KS digested with the same enzymes.The DNA sequences of at least eight individual inserts per zinc fingerprotein were determined.

EXAMPLE 7 Sequences of Exemplary Proteins

The following are the amino acid sequences of the DNA binding regions ofexemplary proteins that can regulate VEGF-A: TABLE 12 Amino AcidSequences of DNA Binding Domains of Exemplary Proteins SEQ ID Name AminoAcid Sequence NO: F475 YKCGQCGKFY SQVSHLTRHQ KIHTGEKPFQ CKTCQRKFSR 20SDHLKTHTRT HTGEKPYICR KCGRGFSRKS NLIRHQRTHT GEK F121 YKCEECGKAFRQSSHLTTHK IIHTGEKPYK CMECGKAFNR 21 RSHLTRHQRI HTGEKPFQCK TCQRKFSRSDHLKTHTRTHT GEK F435 YKCGQCGKFY SQVSHLTRHQ KIHTGEKPFQ CKTCQRKFSR 22SDHLKTHTRT HTGEKPYKCM ECGKAFNRRS HLTRHQRIHT GEK F547 YKCMECGKAFNRRSHLTRHQ RIHTGEKPFQ CKTCQRKFSR 23 SDHLKTHTRT HTGEKPYECD HCGKAFSVSSNLNVHRRIHT GEK F2825 YECDHCGKSF SQSSHLNVHK RTHTGEKPFL CQYCAQRFGR 24KDHLTRHMKK SHTGEKPFQC KTCQRKFSRS DHLKTHTRTH TGEK F480 YKCMECGKAFNRRSHLTRHQ RIHTGEKPFQ CKTCQRKFSR 25 SDHLKTHTRT HTGEKPYKCM ECGKAFNRRSHLTRHQRIHT GEK F2828 YKCKQCGKAF GCPSNLRRHG RTHTGEKPYR CEECGKAFRW 26PSNLTRHKRI HTGEKPFLCQ YCAQRFGRKD HLTRHMKKSH TGEK F625 YKCKQCGKAFGCPSNLRRHG RTHTGEKPYR CEECGKAFRW 27 PSNLTRHKRI HTGEKPYKCM ECGKAFNRRSHLTRHQRIHT GEK F2830 YRCKYCDRSF SDSSNLQRHV RNIHTGEKPY RCEECGKAFR 28WPSNLTRHKR IHTGEKPFLC QYCAQRFGRK DHLTRHMKKS HTGEK F2838 YRCKYCDRSFSDSSNLQRHV RNIHTGEKPY RCEECGKAFR 29 WPSNLTRHKR IHTGEKPYKC MECGKAFNRRSHLTRHQRIH TGEK F2604 YSCGICGKSF SDSSAKRRHC ILHTGEKPYI CRKCGRGFSR 30KSNLIRHQRT HTGEKPFQCK TCQRKFSRSD HLKTHTRTHT GEKPYTCKQC GKAFSVSSSLRRHETTHTGE K F2605 YKCEECGKAF RQSSHLTTHK IIHTGEKPYS CGICGKSFSD 31SSAKRRHCIL HTGEKPYICR KCGRGFSRKS NLIRHQRTHT GEKPFQCKTC QRKFSRSDHLKTHTRTHTGE K F2607 FQCKTCQRKF SRSDHLKTHT RTHTGEKPYE CDHCGKAFSV 32SSNLNVHRRI HTGEKPYKCE ECGKAFRQSS HLTTHKIIHT GEKPYSCGIC GKSFSDSSAKRRHCILHTGE K F2615 YKCMECGKAF NRRSHLTRHQ RIHTGEKPYT CSDCGKAFRD 33KSCLNRHRRT HTGEKPYKCE ECGKAFRQSS HLTTHKIIHT GEKPYTCSDC GKAFRDKSCLNRHRRTHTGE K F2633 YECEKCGKAF NQSSNLTRHK KSHTGEKPYK CGQCGKFYSQ 34VSHLTRHQKI HTGEKPFQCK TCQRKFSRSD HLKTHTRTHT GEKPYICRKC GRGFSRKSNLIRHQRTHTGE K F2634 YKCKQCGKAF GCPSNLRRHG RTHTGEKPFQ CKTCQRKFSR 35SDHLKTHTRT HTGEKPYICR KCGRGFSRKS NLIRHQRTHT GEKPYKCMEC GKAFNRRSHLTRHQRIHTGE K F2636 YKCMECGKAF NRRSHLTRHQ RIHTGEKPYK CEECGKAFRQ 36SSHLTTHKII HTGEKPYKCM ECGKAFNRRS HLTRHQRIHT GEKPYVCDVE GCTWKFARSDELNRHKKRHT GEK F2644 YECEKCGKAF NQSSNLTRHK KSHTGEKPYK CMECGKAFNR 37RSHLTRHQRI HTGEKPYKCP DCGKSFSQSS SLIRHQRTHT GEKPYKCMEC GKAFNRRSHLTRHQRIHTGE K F2646 YKCEECGKAF RQSSHLTTHK IIHTGEKPYT CSDCGKAFRD 38KSCLNRHRRT HTGEKPFQCK TCQRKFSRSD HLKTHTRTHT GEKPYKCKQC GKAFGCPSNLRRHGRTHTGE K F2650 YKCEECGKAF RQSSHLTTHK IIHTGEKPYR CEECGKAFRW 39PSNLTRHKRI HTGEKPYKCM ECGKAFNRRS HLTRHQRIHT GEKPYRCEEC GKAFRWPSNLTRHKRIHTGE K F2679 YECDHCGKAF SVSSNLNVHR RIHTGEKPYK CMECGKAFNR 40RSHLTRHQRI HTGEKPYVCD VEGCTWKFAR SDELNRHKKR HTGEKPYVCS KCGKAFTQSSNLTVHQKIHT GEK F2610 YICRKCGRGF SRKSNLIRHQ RTHTGEKPYK CMECGKAFNR 41RSHLTRHQRI HTGEKPFQCK TCQRKFSRSD HLKTRTRTHT GEKPYKCMEC GKAFNRRSHLTRHQRIHTGE K F2612 YKCMECGKAF NRRSHLTRHQ RIHTGEKPFQ CKTCQRKFSR 42SDHLKTHTRT HTGEKPYKCM ECGKAFNRRS HLTRHQRIHT GEKPFQCKTC QRKFSRSDHLKTHTRTHTGE K F2638 YICRKCGRGF SRKSNLIRHQ RTHTGEKPYK CGQCGKFYSQ 43VSHLTRHQKI HTGEKPFQCK TCQRKFSRSD HLKTHTRTHT GEKPYKCMEC GKAFNRRSHLTRHQRIHTGE K F109 YVCDVEGCTW KFARSDELNR HKKRHTGEKP YKCPDCGKSF 44SQSSSLIRHQ RTHTGEKPYK CEECGKAFRQ SSHLTTHKII HTGEKPYICR KCGRGFSRKSNLIRHQRTHT GEK F2608 YKCMECGKAF NRRSHLTRHQ RIHTGEKPFQ CKTCQRKFSR 45SDHLKTHTRT HTGEKPYECD HCGKAFSVSS NLNVHRRIHT GEKPYKCEEC GKAFRQSSHLTTHKIIHTGE K F2611 YKCMECGKAF NRRSHLTRHQ RIHTGEKPYK CMECGKAFNR 46RSHLTRHQRI HTGEKPYRCE ECGKAFRWPS NLTRHKRIHT GEKPYKCMEC GKAFNRRSHLTRHQRIHTGE K F2617 YVCDVEGCTW KFARSDELNR HKKRHTGEKP YKCMECGKAF 47NRRSHLTRHQ RIHTGEKPYT CSDCGKAFRD KSCLNRHRRT HTGEKPYKCE ECGKAFRQSSHLTTHKIIHT GEK F2619 YKCMECGKAF NRRSHLTRHQ RIHTGEKPYE CNYCGKTFSV 48SSTLIRHQRI HTGEKPYECE KCGKAFNQSS NLTRHKKSHT GEKPFQCKTC QRKFSRSDHLKTHTRTHTGE K F2623 YKCEECGKAF RQSSHLTTHK IIHTGEKPYI CRKCGRGFSR 49KSNLIRHQRT HTGEKPYRCE ECGKAFRWPS NLTRHKRIHT GEKPYVCDVE GCTWKFARSDELNRHKKRHT GEK F2625 YKCEECGKAF RQSSHLTTHK IIHTGEKPYR CEECGKAFRW 50PSNLTRHKRI HTGEKPFQCK TCQRKFSRSD HLKTHTRTHT GEKPYVCDVE GCTWKFARSDELNRHKKRHT GEK F2628 YTCKQCGKAF SVSSSLRRHE TTHTGEKPYR CEECGKAFRW 51PSNLTRHKRI HTGEKPYICR KCGRGFSRKS NLIRHQRTHT GEKPYTCKQC GKAFSVSSSLRRHETTHTGE K F2629 YKCGQCGKFY SQVSHLTRHQ KIHTGEKPYT CKQCGKAFSV 52SSSLRRHETT HTGEKPYRCE ECGKAFRWPS NLTRHKRIHT GEKPYICRKC GRGFSRKSNLIRHQRTHTGE K F2630 YVCDVEGCTW KFARSDELNR HKKRHTGEKP YKCGQCGKFY 53SQVSHLTRHQ KIHTGEKPYT CKQCGKAFSV SSSLRRHETT HTGEKPYRCE ECGKAFRWPSNLTRHKRIHT GEK F2635 YKCGQCGKFY SQVSHLTRHQ KIHTGEKPYI CRKCGRGFSR 55KSNLIRHQRT HTGEKPYKCG QCGKFYSQVS HLTRHQKIHT GEKPFQCKTC QRKFSRSDHLKTHTRTHTGE K F2637 FQCKTCQRKF SRSDHLKTHT RTHTGEKPYI CRKCGRGFSR 56KSNLIRHQRT HTGEKPYKCM ECGKAFNRRS HLTRHQRIHT GEKPYRCEEC GKAFRWPSNLTRHKRIHTGE K F2642 FQCKTCQRKF SRSDHLKTHT RTHTGEKPYK CMECGKAFNR 57RSHLTRHQRI HTGEKPYKCK QCGKAFGCPS NLRRHGRTHT GEKPFQCKTC QRKFSRSDHLKTHTRTHTGE K F2643 YKCMECGKAF NRRSHLTRHQ RIHTGEKPYK CKQCGKAFGC 58PSNLRRHGRT HTGEKPFQCK TCQRKFSRSD HLKTHTRTHT GEKPYKCKQC GKAFGCPSNLRRHGRTHTGE K F2648 YKCPDCGKSF SQSSSLIRHQ RTHTGEKPYK CGQCGKFYSQ 59VSHLTRHQKI HTGEKPYICR KCGRGFSRKS NLIRHQRTHT GEKPYICRKC GRGFSRKSNLIRHQRTHTGE K F2651 YECNYCGKTF SVSSTLIRHQ RIHTGEKPYK CEECGKAFRQ 60SSHLTTHKII HTGEKPYRCE ECGKAFRWPS NLTRHKRIHT GEKPYKCMEC GKAFNRRSHLTRHQRIHTGE K F2653 YECNYCGKTF SVSSTLIRHQ RIHTGEKPYE CEKCGKAFNQ 63.SSNLTRHKKS HTGEKPYKCM ECGKAFNRRS HLTRHQRIHT GEKPYECEKC GKAFNQSSNLTRHKKSHTGE K F2654 YECEKCGKAF NQSSNLTRHK KSHTGEKPYK CMECGKAFNR 62RSHLTRHQRI HTGEKPYECE KCGKAFNQSS NLTRHKKSHT GEKPYECDHC GKAFSVSSNLNVHRRIHTGE K F2662 YTCSDCGKAF RDKSCLNRHR RTHTGEKPFQ CKTCQRKFSR 63SDHLKTHTRT HTGEKPYECN YCGKTFSVSS TLIRHQRIHT GEKPYVCDVE GCTWKFARSDELNRHKKRHT GEK F2667 YKCMECGKAF NRRSHLTRHQ RIHTGEKPYT CSDCGKAFRD 64KSCLNRHRRT HTGEKPFQCK TCQRKFSRSD HLKTHTRTHT GEKPYKCMEC GKAFNRRSHLTRHQRIHTGE K F2668 YKCMECGKAF NRRSHLTRHQ RIHTGEKPYK CMECGKAFNR 65RSHLTRHQRI HTGEKPYVCS KCGKAFTQSS NLTVHQKIHT GEKPYVCSKC GKAFTQSSNLTVHQKIHTGE K F2673 FQCKTCQRKF SRSDHLKTHT RTHTGEKPYT CKQCGKAFSV 66SSSLRRHETT HTGEKPYVCD VEGCTWKFAR SDELNRHKKR HTGEKPYKCP DCGKSFSQSSSLIRHQRTHT GEK F2682 YICRKCGRGF SRKSNLIRHQ RTHTGEKPYK CPDCGKSFSQ 67SSSLIRHQRT HTGEKPYECE KCGKAFNQSS NLTRHKKSHT GEKPYKCMEC GKAFNRRSHLTRHQRIHTGE K F2689 YICRKCGRGF SRKSNLIRHQ RTHTGEKPYS CGICGKSFSD 68SSAKRRHCIL HTGEKPYECE KCGKAFNQSS NLTRHKKSHT GEKPYKCEEC GKAFRQSSHLTTHKIIHTGE K F2697 YKCMECGKAF NRRSHLTRHQ RIHTGEKPYK CKQCGKAFGC 69PSNLRRHGRT HTGEKPYKCE ECGKAFRQSS HLTTHKIIHT GEKPYICRKC GRGFSRKSNLIRHQRTHTGE K F2699 YICRKCGRGF SRKSNLIRHQ RTHTGEKPYK CEECGKAFRQ 70SSHLTTHKII HTGEKPYSCG ICGKSFSDSS AKRRHCILHT GEKPYKCMEC GKAFNRRSHLTRHQRIHTGE K F2703 YKCGQCGKFY SQVSHLTRHQ KIHTGEKPYK CMECGKAFNR 71RSHLTRHQRI HTGEKPYVCD VEGCTWKFAR SDELNRHKKR HTGEKPYKCM ECGKAFNRRSHLTRHQRIHT GEK F2702 YKCMECGKAF NRRSHLTRHQ RIHTGEKPYK CGQCGKFYSQ 54VSHLTRHQKI HTGEKPYKCM ECGKAFNRRS HLTRHQRIHT GEKPYVCSKC GKAFTQSSNLTVHQKIHTGE K

A polypeptide, e.g., that includes a sequence described above, alsoinclude a tag (e.g., the HA tag), a NLS, a linker, and a regulatorydomain (e.g., an activation or repression domain). These elements can bearrange in any order, from N- to C-terminus. In one example, thepolypeptide is arranged as follows: HA tag-NLS-PGEKP-DNA binding domain(e.g., a sequence described above)-AAA-p65. Or more particularly:

MVYPYDVPDYAELPPKKKRKVGIRIPGEKP-DNA_BIDING_DOMAIN-AAA-p65; (wherein theleader N-terminal to the DNA binding domain is SEQ ID NO:126)

-   -   YPYDVPDYA (3-12 of SEQ ID NO:126) is an exemplary tag (ere the        HA-tag)    -   PPKKKRKV (15-21 of SEQ ID NO:126) is an exemplary NLS (Nuclear        localization signal)    -   “ZFP” is an array of zinc finger domains

In another example, the polypeptide includes the DNA binding domain anda repression domain, e.g., a KRAB or KOX domain.

Nucleic acid encoding a polypeptide described in this example can beproducing using any choice of codons, e.g., codons useful (e.g.,optimized) for prokaryotic expression, codons useful (e.g., optimized)for eukaryotic expression, or codons that encode corresponding naturallyoccurring domains.

Results indicate that a number of zinc finger can activate VEGF-Aproduction. TABLE 13 VEGF-A Activation Fold ZFP ID Activation F109 3.5F121 4.4 F435 12.5 F475 11.1 F480 9.2 F625 9.0 F2604 4.9 F2605 10.9F2607 5.4 F2608 2.1 F2610 7.4 F2612 6.3 F2615 8.1 F2617 2.3 F2619 2.3F2623 2.3 Irrelevant 1.1 ZFP F2625 2.1 F2628 1.8 F2629 3.8 F2630 2.0F2633 11.9 F2634 6.5 F2635 2.8 F2636 13.3 F2638 5.7 F2642 3.6 F2643 3.6F2644 10.7 F2646 10.2 F2648 2.3 F2650 12.6 F2651 4.9 Parental 1.0 vectorP3 F2653 2.6 F2654 2.3 F2668 2.1 F2673 3.1 F2679 4.5 F2682 3.4 F2689 2.1F2697 1.9 F2699 1.9 F2702 3.1 F2703 4.5 F2825 2.8 F2828 8.8 F2830 6.8F2838 5.8

EXAMPLE 8 VEGF-A Production by an Encapsulated Cells

A nucleic acid construct that includes a coding region encoding theF435-p65 zinc finger protein operably linked to a doxycycline-induciblepromoter was stably transfected into Flp-T-Rex293 cells.

Human embryonic kidney (HEK) cell lines stably expressing ZFP-TFs weregenerated as follows: Plasmids encoding ZFP-TFs were stably introducedinto FlpTRex-293 cell lines (Invitrogen) essentially as described in themanufacturer's protocol. Briefly, the HindIII.XhoI fragment from thepLFD-p65 vectors, which contain DNA segments that encode ZFP-TFs, weresubcloned into pcDNA5/FRT/TO (Invitrogen). The resulting plasmids werecotransfected along with pOG44 (Invitrogen) into Flp-In™ TRex™-293 cellsto induce a site-specific integration event. Stable integrants were thenscreened. The resulting cell lines expressed zinc finger proteinconditionally upon the addition of Doxycyclin (1 ug/ml) into culturemedium. To immobilize the cells stably expressing F435-p65, SodiumAlginate (Sigma) was dissolved in PBS to a final concentration of 1%(wt/v) and gently mixed with cells to a final density of 10⁶ (cells/ml).The suspension of cells were added dropwise into CaCl₂ (100 mM) wherecellular capsules were solidified for 15 min, then were washed in PBS.The encapsulated cells were added to culture medium. Expression wasinduced with 1 μg/ml doxycycline and the amount of VEGF-A produced bythe encapsulated cells was measured. In one experiment with F435-p65,the cells (cell line #151) grown in the presence of doxycycline producedat least 600 pg/mL of VEGF-A after 2 days, at least 4000 pg/mL afterthree days, about 5000 pg/mL at four days, and at least 5300 pg/mL atfive days. VEGF-A production was at least 5, 10, 50, or 100 fold greaterthan controls that did not include the F435-p65 zinc finger protein orcells that were not grown in the presence of doxycycline.

EXAMPLE 9 Cell-Based Assay for Human VEGF-A Expression

The 3×10⁴ HEK293T cells were transfected with 100 ng of each pLFD-4F-p65plasmid in 96-culture plates precoated with poly-L-lysine (Biocoat). Theculture supernatants were harvested at 48 hours post transfection andstored immediately at −80° C. until they were used. The transfectionefficiency was estimated at a well of each plate transfected with 100 ngof lacZ, by staining with X-gal. The calculated transfectionefficiencies varied in a range of 70-80% in each experiment.

The production of VEGF-A was analyzed by measuring secreted VEGF-Aprotein by sandwich ELISA. The capture antibody (AF-293-NA from R&DSystems), biotinylated detection antibody (BAF293 from R&D Systems) werepurchased from R&D systems, streptavidin-AP (SA110) and substrate buffer(ES011) from Chemicon, substrate pNPP (N-9389) from Sigma Aldrich. TheELISA procedures were carried out with automated workstation (GENESISRSP 150™, TECAN). The optical density (OD) at 405 nm was measured(POWERWAVE™ X340, BioTek Instrument Inc.) and the quantity of VEGF-A wascalculated from standard curve obtained from the OD values of seriallydiluted recombinant human VEGF-A protein (R&D systems). Relative VEGF-Aproduction was calculated by normalizing VEGF-A concentrations obtainedfrom cultures individually transfected with pLFD-4F-p65 to that obtainedfrom cultures transfected with the parental vector p3.

EXAMPLE 10 Cell-Based Assay for Human VEGF-A Expression

The zinc finger protein F121 consisted of three human zinc fingerdomains designed to bind 9 bp sequences of human VEGF promoter at aboutnucleotide +434 relative to the transcription initiation site of humanVEGF-A gene; F109 consisted of four human zinc finger domains designedto bind a 12 bp sequence of human VEGF promoter at about the −536nucleotide relative to the transcription initiation site of human VEGF-Agene; and F435 consisted of three human zinc finger domains designed tobind 9 bp sequences at the positions −90R and −391R (wherein R meansreverse strand) of human VEGF-A gene.

Construction of Luciferase Reporter Plasmids Containing Human VEGFPromoter

The native human VEGF promoter DNA (at position −950 to +450, numberingrelative to the transcription initiation sequence shown in FIGS. 1A, B,C) was PCR-amplified from human genomic DNA using sequence specificprimers and cloned into the KpnI/XhoI restriction site of plasmidpGL3(Promega, E1751), and the resulting plasmid was designatedpGL3-VEGFprom (FIG. 5B).

Repression of the Luciferase Reporter Containing Native Human VEGFPromoter by Zinc Finger Protein

293 cells were transfected with luciferase reporter plasmidpGL3-VEGFprom containing native human VEGF promoter (−950 to +450 fromthe transcription initiation site) and 30 ng of pLFD-F121-KRAB orpLFD-F109-KRAB. Luciferase activity was measured as described. Foldrepression values were calculated by normalizing the firefly luciferaseactivity against the renilla luciferase activity and the result wascompared with that of the control wherein 293 cells were transfectedwith the control vector pLFD and the reporter plasmid.

The plasmids encoding F121-KRAB (30 ng) and F109-KRAB (30 ng) reducedthe reporter activity 8.7 fold and 6.1 fold, respectively.

Repression of Endogenous VEGF-A mRNA Expression by ZFP-KRAB

ZFP expression plasmids were transfected into human embryonic kidney293F cells (Gibco Life Technologies). 293F cells allow for hightransfection efficiencies.

293F cells were precultured in the wells of a 24-well culture plate, ata density of 10⁵ cells/well, in 1 ml of DMEM supplemented with 10% FBSfor 24 h in a humid atmosphere containing 5% CO₂ at 37° C. The cellswere transfected with 0, 200, or 400 ng of plasmids encoding chimericzinc finger proteins of interest using a LIPOFECTAMINE PLUS™ (LifeTechnologies). The total amount of DNA was adjusted to 400 ng by addingthe parental vector as a control if less than 400 ng of the zinc fingerprotein expression vector was used. The cells were further incubated for48 hours. The total RNA was extracted from the cells with the TRIZOL®reagent (Gibco Life Technologies).

Quantification of VEGF mRNA was Carried Out by the Following Real TimeRT-PCR.

The reverse transcription reactions were performed with 4 μg of thetotal RNA using oligo-dT as the first-strand synthesis primer for mRNA,dNTP and MMLV reverse transcriptase provided in the Superscriptfirst-strand synthesis system (Gibco Life Technologies) to obtain afirst-strand cDNA. To analyze mRNA quantities, 1 μl of the first-strandcDNA thus obtained was amplified by real time PCR using VEGF-A cDNAspecific primers (Forward primer 5′-CGGGGTACCCCCTCCCAGTCACTGACTAAC-3′,SEQ ID NO:127) and (Reverse primer 5′-CCGCTCGAGTCCGGCGGTCACCCCCAAAAG-3′;SEQ ID NO:128). Since this method is sensitive to the initial amount ofRNA, the initial RNA amounts were normalized to the GAPDH mRNAquantities calculated by specific amplification using GAPDH-specificprimers. The amplification of VEGF- and GAPDH-specific cDNAs wasmonitored and analyzed in real-time with a QUANTITECT SYBR™ kit (QIAGEN,Valencia, Calif.) and ROTORGENE™ 2000 real-time cycler (Corbett, Sydney,Australia), and the cDNAs were quantified by serial dilution of thestandards included in the reactions.

Repression of VEGF-A mRNA Synthesis by Zinc Finger Proteins

The expression of endogenous VEGF-A mRNA was reduced 2.2 fold (54.5%repression, 200 ng pLFD-F435-KRAB) and 4.1 fold (75.6% repression, 400ng pLFD-F435-KRAB) relative to untreated control cells. These resultsshow a dose dependant effect.

Repression of VEGF-A Protein Production by ZFP (F435-KRAB)

In order to examine whether the repression of VEGF-A mRNA expressionresulted in the reduction of VEGF-A protein secretion, 293F cells(10⁴/96 well plate) were transfected with 0 to 200 ng of ZFP expressionplasmids (pLFD-F435-KRAB) and cultured for 72 hours. VEGF protein thataccumulated in the culture medium (100 ul each) and dilutions of arecombinant human VEGF-A protein standards were analyzed using sandwichELISA (enzyme linked immunosorbent assay), wherein the supernatant ofculture was reacted with a anti-human VEGF antibody (R&D systems;AF-293-NA), biotinylated anti-human VEGF antibody (R&D systems; BAF293),streptavidin-alkaline phosphatase. The antigen-antibody complex wasreacted with pNPP (p-Nitrophenyl phosphate) dissolved in pNPP buffer(Chemicon; ES011). VEGF-A concentrations in the samples were determinedfrom the absorbance at 405 nm which was measured with POWERWAVE™X340(Bio-TEK Instrument Inc., Winooski Vt.).

F435-KRAB reduced VEGF-A production in a dose dependant manner. When 200ng of the plasmid was used VEGF-A protein concentration was repressed3.9 fold (138 pg/ml) relative to control cells transfected with acontrol plasmid, pLFD-F435-KRAB 200 ng. See Table 14. TABLE 14 Titrationof F435-KRAB Concentration of F435-KRAB plasmid (ng) Control 25 50 100200 (200 ng) VEGF-A 420 ± 98 345 ± 50 172 ± 13 138 ± 14 536 ± 14 (pg/ml)Fold 1.3 1.6 3.1 3.9 1.0 RepressionRepression of VEGF-A Gene Induction by Hypoxic Conditions

VEGF-A gene is known as a crucial factor for inducing angiogenesis.VEGF-A activity is essential for the development and growth of manytumors. VEGF-A activity has been found to be stimulated by hypoxiacondition in cancer tissues. A high level of VEGF-A expression isfrequently observed in tumor cells.

When the medium for culturing 293F cells is treated with 100 to 800 μMof CoCl₂ for about 7 hours, a hypoxia condition is induced and VEGFproduction by cells is rapidly escalated. The following experiment wascarried out in order to examine whether the zinc finger protein caninhibit the VEFG expression in the hypoxia condition.

293F cells (10⁴ cells/well, 96-well plate) were transfected withpLFD-F435-KRAB 50 ng and incubated for 48 hours. In order to induce thehypoxic condition, 800 μM of CoCl₂ was added to the medium at the last 7hours stage of the culture. The amount of VEGF-A secreted in the culturemedium was determined by ELISA.

VEGF production from the hypoxic CoCl₂ treated culture withmock-transfected cells increased to about 1,039 pg/ml, in contrast toabout 273 pg/ml in the untreated control cells. This observationconfirms that hypoxia strongly induces VEGF-A production. However, cellstransfected with pLFD-F435-KRAB did not induce VEGF-A production inhypoxic conditions. These cells produced only about 272 pg/ml of VEGF-A,a concentration similar to the non-hypoxic control. This resultsdemonstrates that expression of F435-KRAB inhibits VEGF-A productionunder hypoxic conditions. Since the transfection rate was only about85-90%, it is possible that the residual level of VEGF-A production isdue to the untransfected cells in the culture. We concluded thatF435-KRAB and similarly functional chimeric zinc finger proteins arepotent repressors of VEGF-A expression.

The selected zinc finger proteins or related proteins that includedomains with the same motifs may be used, e.g., as therapeutic agents.Such agents can be, e.g., to repress VEGF-A expression and therebyretard the growth of tumor cells.

A number of embodiments of the invention have been described.Nevertheless, it will be understood that various modifications may bemade without departing from the spirit and scope of the invention.Accordingly, other embodiments are within the scope of the followingclaims.

1. A polypeptide comprising a DNA binding domain that includes aplurality of zinc finger domains, wherein the DNA binding domain canbind to a site in a VEGF gene, and at least two of the zinc fingerdomains each include respective zinc finger domain motifs listed incolumn 2 of Table 1, Table 2, Table 3, Table 4 or Table
 5. 2. Thepolypeptide of claim 1, wherein the zinc finger domains each includerespective zinc finger domain motifs listed in column 2 of Table 1 orTable
 3. 3. The polypeptide of claim 2, wherein the zinc finger domainis selected from the group consisting of the zinc finger domains listedin column 3 of Table 1 or Table
 3. 4. The polypeptide of claim 3,wherein the DNA binding domain includes, in N-terminal to C-terminalorder, first, second and third zinc finger domains, wherein (1) the DNAcontacting residues at positions −1, 2, 3, and 6 of first zinc fingerdomain are QSHR, those of the second zinc finger domain are RDHT, andthose of the third zinc finger domain are RSX₁R, wherein X₁ is H or N;(2) the DNA contacting residues at positions −1, 2, 3, and 6 of firstzinc finger domain are QSHX₂, those of the second zinc finger domain areRX₃HR, and those of the third zinc finger domain are RDHT, wherein X₂ isT or V and X₃ is S or D; (3) the DNA contacting residues at positions−1, 2, 3, and 6 of first zinc finger domain are RSHR, those of thesecond zinc finger domain are RDHT, and those of the third zinc fingerdomain are VSNV; (4) the DNA contacting residues at positions −1, 2, 3,and 6 of first zinc finger domain are RDER, those of the second zincfinger domain are QSSR, and those of the third zinc finger domain areQSHT; (5) the DNA contacting residues at positions −1, 2, 3, and 6 offirst zinc finger domain are QSSR, those of the second zinc fingerdomain are QSHT, and those of the third zinc finger domain are RSNR; (6)the DNA contacting residues at positions −1, 2, 3, and 6 of first zincfinger domain are QSNR, those of the second zinc finger domain are QSHR,and those of the third zinc finger domain are RDHT; (7) the DNAcontacting residues at positions −1, 2, 3, and 6 of first zinc fingerdomain are QSHR, those of the second zinc finger domain are RDHT, andthose of the third zinc finger domain are RSNR; (8) the DNA contactingresidues at positions −1, 2, 3, and 6 of first zinc finger domain areRSHR, those of the second zinc finger domain are QSHT, and those of thethird zinc finger domain are RSHR; (9) the DNA contacting residues atpositions −1, 2, 3, and 6 of first zinc finger domain are QSHT, those ofthe second zinc finger domain are RSHR, and those of the third zincfinger domain are RDER; (10) the DNA contacting residues at positions−1, 2, 3, and 6 of first zinc finger domain are QSNR, those of thesecond zinc finger domain are RSHR, and those of the third zinc fingerdomain are QSSR; (11) the DNA contacting residues at positions −1, 2, 3,and 6 of first zinc finger domain are RSHR, those of the second zincfinger domain are QSSR, and those of the third zinc finger domain areRSHR; (12) the DNA contacting residues at positions −1, 2, 3, and 6 offirst zinc finger domain are QSHT, those of the second zinc fingerdomain are WSNR, and those of the third zinc finger domain are RSHR; or(13) the DNA contacting residues at positions −1, 2, 3, and 6 of firstzinc finger domain are WSNR, those of the second zinc finger domain areRSHR, and those of the third zinc finger domain are WSNR.
 5. Thepolypeptide of claim 2, wherein the zinc finger domain is selected fromthe group consisting of the zinc finger domains listed in column 3 ofTable 2, Table 4 or Table
 5. 6. The polypeptide of claim 1, wherein theVEGF gene is the human VEGF-A gene.
 7. The polypeptide of claim 1, whichregulates the VEGF gene expression.
 8. The polypeptide of claim 1,wherein the polypeptide further comprises a transcription activationdomain, a transcription repression domain, or a protein transductiondomain.
 9. The polypeptide of claim 8, wherein the transcriptionactivation domain comprises p65 or VP16 activation domain.
 10. Thepolypeptide of claim 8, wherein the transcription repression domaincomprises Kid or KOX repression domain.
 11. The polypeptide of claim 8,wherein the protein transduction domain is a part of TAT protein, VP22protein, or Antennapedia homeodomain.
 12. A nucleic acid that comprisesa sequence encoding the polypeptide of claim
 1. 13. The nucleic acid ofclaim 12, which comprises a sequence encoding the polypeptide of claim8.
 14. A modified mammalian cell that contains the polypeptide ofclaim
 1. 15. The cell of claim 14, wherein the polypeptide is producedfrom a nucleic acid of claim 14 in the cell.
 16. A pharmaceuticalcomposition for preventing or treating a neoplastic disorder, aninflammatory disorder, or an angiogenesis-based disorder, whichcomprises the polypeptide of claim 1, the nucleic acid of claim 12 orthe modified mammalian cell of claim 14, and a pharmaceuticallyacceptable carrier.
 17. The pharmaceutical composition of claim 16,wherein the neoplastic disorder is a cancer.
 18. The pharmaceuticalcomposition of claim 16, wherein the zinc finger domain included in thepolypeptide is selected from the group consisting of the zinc fingerdomains listed in column 3 of Table 1 or Table
 3. 19. The pharmaceuticalcomposition of claim 18, wherein the polypeptide comprises a DNA bindingdomain that includes, in N-terminal to C-terminal order, first, secondand third zinc finger domains, wherein (1) the DNA contacting residuesat positions −1, 2, 3, and 6 of first zinc finger domain are QSHR, thoseof the second zinc finger domain are RDHT, and those of the third zincfinger domain are RSX₁R, wherein X₁ is H or N; (2) the DNA contactingresidues at positions −1, 2, 3, and 6 of first zinc finger domain areQSHX₂, those of the second zinc finger domain are RX₃HR, and those ofthe third zinc finger domain are RDHT, wherein X₂ is T or V and X₃ is Sor D; (3) the DNA contacting residues at positions −1, 2, 3, and 6 offirst zinc finger domain are RSHR, those of the second zinc fingerdomain are RDHT, and those of the third zinc finger domain are VSNV; (4)the DNA contacting residues at positions −1, 2, 3, and 6 of first zincfinger domain are RDER, those of the second zinc finger domain are QSSR,and those of the third zinc finger domain are QSHT; (5) the DNAcontacting residues at positions −1, 2, 3, and 6 of first zinc fingerdomain are QSSR, those of the second zinc finger domain are QSHT, andthose of the third zinc finger domain are RSNR; (6) the DNA contactingresidues at positions −1, 2, 3, and 6 of first zinc finger domain areQSNR, those of the second zinc finger domain are QSHR, and those of thethird zinc finger domain are RDHT; (7) the DNA contacting residues atpositions −1, 2, 3, and 6 of first zinc finger domain are QSHR, those ofthe second zinc finger domain are RDHT, and those of the third zincfinger domain are RSNR; (8) the DNA contacting residues at positions −1,2, 3, and 6 of first zinc finger domain are RSHR, those of the secondzinc finger domain are QSHT, and those of the third zinc finger domainare RSHR; (9) the DNA contacting residues at positions −1, 2, 3, and 6of first zinc finger domain are QSHT, those of the second zinc fingerdomain are RSHR, and those of the third zinc finger domain are RDER;(10) the DNA contacting residues at positions −1, 2, 3, and 6 of firstzinc finger domain are QSNR, those of the second zinc finger domain areRSHR, and those of the third zinc finger domain are QSSR; (11) the DNAcontacting residues at positions −1, 2, 3, and 6 of first zinc fingerdomain are RSHR, those of the second zinc finger domain are QSSR, andthose of the third zinc finger domain are RSHR; (12) the DNA contactingresidues at positions −1, 2, 3, and 6 of first zinc finger domain areQSHT, those of the second zinc finger domain are WSNR, and those of thethird zinc finger domain are RSHR; or (13) the DNA contacting residuesat positions −1, 2, 3, and 6 of first zinc finger domain are WSNR, thoseof the second zinc finger domain are RSHR, and those of the third zincfinger domain are WSNR.
 20. An encapsulated composition comprising anencapsulation layer composed of a biocompatible material that ispermeable to proteins having a molecular weight of at least 10 kDa, andrecombinant mammalian cells, wherein the cells contain a nucleic acidcomprising a sequence encoding a chimeric zinc finger protein thatregulates production of a secreted factor.
 21. The encapsulatedcomposition of claim 20, wherein the secreted factor is insulin, aninsulin-like growth factor, VEGF, HGF, interferon, interleukin, or afibroblast growth factor.
 22. A method of regulating VEGF geneexpression, which comprises introducing the polypeptide of claim 1 orthe nucleic acid of claim 12 into a cell.
 23. The method of claim 22,wherein the polypeptide comprises a transcription activation domain, andVEGF gene expression is increased in the cell.
 24. The method of claim22, wherein the polypeptide comprises a transcription repression domain,and VEGF gene expression is decreased in the cell.
 25. The method ofclaim 22, wherein the VEGF gene is human VEGF-A gene.
 26. The method ofclaim 22, wherein the cell is a mammalian cell.
 27. The method of claim26, wherein the cell is a human cell.
 28. A method of modulatingangiogenesis in a subject, which comprises administering the compositionof claim 16 to the subject in an amount effective to reduce angiogenesisin the subject.
 29. The method of claim 28, wherein the subject is ahuman that has or is suspected of having a cancer.